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CONFIDENTIAL 

>CPN100638 putative 98 kDa outer membrane protein 

MKSSLHWFLISSSLALPLSLNFSAFAAVVEINLGPTNSFSGPGTYTPPAQTTNADGTIYNLTGDVSITNAGSPTA 

LTASCFKETTGlsnL,SFCK3HGYQFLLQNIDAGANCTFTNTAANKLLSFSGFSYLSLIQTTNATTGTGAIKSTGACSI 

QSNySCYFGQNFS^TONGGALQGSSISLSLNP^^LTFAK^IKATQKGGALYSTGGITINNTLNSASF 

lYTEASSFISSNKAISFINNSVTATSATGGAIYCSSTSAPKPVLTLSDNGELNFIGNTAITSGGAIYTDNLVLSS 

GG PTLFKNNSGYDTAAPLGGAIAIADSGSLSLSALGGDITFEGNTVVKGASSSQTTTRNSINIGimJAKIVQ 

S QGNTI YFYDP ITTS I TAALS DAIJSJLNG PDLAGNP A YQGT I VFS GEKLS EAEAAEAD^^LK S T I QQ PLTLA^ 

LKSGVTLVAKSFSQSPGSTLLMDAGTTLETADGITINNLVLNVDSLKETKKGTLKATQASQTVTLSGSLSLVD^ 

GWYEDVSVmNPQVFSCLTLTADDPANIHITDLAADPLEKNPIHWGYQGI^ALSWQEOTATKSKAATLTWTKT 

NPNPERRGTLVANTLWGSFVDVRSIQQLVATKVRQSQETRGIWCEGISNFFHKDSTKINKGFRHISAGYWGATT 

TLASDNLITAAFCQLFGKDRDHFINKNRASAYAASLHLQHLATLSSPSLLRYLPGSESEQPVLFDAQISYIYSKN 

TMKTYYTQAPKGESSWYNTCCALELASSLPHTALSHEGLFHAYFPFIKVEASYIHQDSFKERNTTLVRSFDSGDL 

IWSVPIGITFERFSRNERASYEATVIYVADVYRKNPDCTTALLINNTSWKTTGTNLSRQAGIGRAGIFY^^ 

LEVTSNLSMEIRGSSRSYNADLGGKFQF 



CODING SEQUENCE 

THE SEQUENCE IS ENCODED ON THE NEGATIVE STRAND 
The ATG is presumably the start codon 
The TAA is presumably the stop codon 

1 TGTAGATTCT TAACTTACTA GTCTCTCCTT TCCTCTTGCT TTCTTTAATT 

51 TATTGCAGTA TGTGGTGAAA TAATTTGTTA AACCACCTAT AGCCCTCTAC 

101 ATG AAATCCT CTCTTCATTG GTTTTTAATC TCGTCATCTT TAGCACTTCC 

151 CTTGTCACTA AATTTCTCTG CGTTTGCTGC TGTTGTTGAA ATCAATCTAG 

2 01 GACCTACCAA TAGCTTCTCT GGACCAGGAA CCTACACTCC TCCAGCCCAA 

2 51 ACAACAAATG CAGATGGAAC TATCTATAAT CTAACAGCCC ATGTCTCAAT 

3 01 CACCAATGCA GGATCTCCGA CAGCTCTAAC CGCTTCCTGC TTTAAAGAAA 

3 51 CTACTGGGAA TCTTTCTTTC CAAGGCCACG GCTACCAATT TCTCCTACAA 
401 AATATCGATG CGGGAGCGAA CTGTACCTTT ACCAATACAG CTGCAAATAA 

4 51 GCTTCTCTCC TTTTCAGGAT TCTCCTATTT GTCACTAATA CAAACCACGA 
501 ATGCTACCAC AGGAACAGGA GCCATCAAGT CCACAGGAGC TTGTTCTATT 



551 CAGTCGAACT ATAGTTGCTA CTTTGGCCAA AACTTTTCTA ATGACAATGG 

6 01 AGGCGCCCTC CAAGGCAGCT CTATCAGTCT ATCGCTAAAC CCCAACCTAA 

651 CGTTTGCCAA AAACAAAGCA ACGCAAAAAG GGGGTGCCCT CTATTCCACG 

701 GGAGGGATTA CAATTAACAA TACGTTAAAC TCAGCATCAT TTTCTGAAAA 




751 TACCGCGGCG AACAATGGCG GAGCCATTTA CACGGAAGCT AGCAGTTTTA 



8 01 TTAGCAGCAA CAAAGCAATT ACCTTTATAA ACAATAGTGT GACCGCAACC 

851 TCAGCTACAG GGGGAGCCAT TTACTGTAGT AGTACATCAG CCCCCAAACC 

901 AGTCTTAACT CTATCAGACA ACGGGGAACT GAACTTTATA GGAAATACAG 

951 CAATTACTAG TGGTGGGGCG ATTTATACTG ACAATCTAGT TCTTTCTTCT 

1001 GGAGGACCTA CGCTTTTTAA AAACAACTCT GGCTATGATA CTGCAGCTCC 

1051 CTTAGGAGGA GCAATTGCGA TTGCTGACTC TGGATCTTTG AGTCTTTCGG 

1101 CTCTTGGTGG AGACATCACT TTTGAAGGAA ACACAGTAGT CAAAGGAGCT 

1151 TCTTCGAGTC AGACCACTAC CAGAAATTCT ATTAACATCG GAAACACCAA 

1201 TGCTAAGATT GTACAGCTGC GAGCCTCTCA AGGCAATACT ATCTACTTCT 

1251 ATGATCCTAT AACAACTAGC ATCACTGCAG CTCTCTCAGA TGCTCTAAAC 

13 01 TTAAATGGTC CTGACCTTGC AGGGAATCCT GCATATCAAG GAACCATCGT 

13 51 ATTTTCTGGA GAGAAGCTCT CGGAAGCAGA AGCTGCAGAA GCTGATAATC 

1401 TCAAATCTAC AATTCAGCAA CCTCTAACTC TTGCGGGAGG GCAACTCTCT 

1451 CTTAAATCAG GAGTCACTCT AGTTGCTAAG TCCTTTTCGC AATCTCCGGG 

1501 CTCTACCCTC CTCATGGATG CAGGGACCAC ATTAGAAACC GCTGATGGGA 

1551 TCACTATCAA TAATCTTGTT CTCAATGTAG ATTCCTTAAA AGAGACCAAG 

1601 AAGGGCACGC TAAAAGCAAC ACAAGCAAGT CAGACAGTCA CTTTATCTGG 

1651 ATCGCTCTCT CTTGTAGATC CTTCTGGAAA TGTCTACGAA GATGTCTCTT 

1701 GGAATAACCC TCAAGTCTTT TCTTGTCTCA CTCTTACTGC TGACGACCCC 

1751 GCGAATATTC ACATCACAGA CTTAGCTGCT GATCCCCTAG AAAAAAATCC 

1801 TATCCATTGG GGATACCAAG GGAATTGGGC ATTATCTTGG CAAGAGGATA 

1851 CTGCGACTAA ATCCAAAGCA GCGACTCTTA CCTGGACAAA AACAGGATAC 

1901 AATCCGAATC CTGAGCGTCG TGGAACCTTA GTTGCTAACA CGCTATGGGG 

1951 ATCCTTTGTT GATGTGCGCT CCATACAACA GCTTGTAGCC ACTAAAGTAC 

2 001 GCCAATCTCA AGAAACTCGC GGCATCTGGT GTGAAGGGAT CTCGAACTTC 

2 051 TTCCATAAAG ATAGCACGAA GATAAATAAA GGTTTTCGCC ACATAAGTGC 

2101 AGGTTATGTT GTAGGAGCGA CTACAACATT AGCTTCTGAT AATCTTATCA 

2151 CTGCAGCCTT CTGCCAATTA TTCGGGAAAG ATAGAGATCA CTTTATAAAT 

2 2 01 AAAAATAGAG CTTCTGCCTA TGCAGCTTCT CTCCATCTCC AGCATCTAGC 



2251 GACCTTGTCT TCTCCTW^GCT TGTTACGCTA CCTTCCTGGA TCTGAAAGTG 

23 01 AGCAGCCTGT CCTCTTTGAT GCTCAGATCA GCTATATCTA TAGTAAAAAT 

23 51 ACTATGAAAA CCTATTACAC CCAAGCACCA AAGGGAGAGA GCTCGTGGTA 

2401 TAATGACGGT TGCGCTCTGG AACTTGCGAG CTCCCTACCA CACACTGCTT 

2451 TAAGCCATGA GGGTCTCTTC CACGCGTATT TTCCTTTCAT CAAAGTAGAA 

2501 GCTTCGTACA TACACCAAGA TAGCTTCAAA GAACGTAATA CTACCITGGT 

2551 ACGATCTTTC GATAGCGGTG ATTTAATTAA CGTCTCTGTG CCTATTGGAA 

2601 TTACCTTCGA GAGATTCTCG AGAAACGAGC GTGCGTCTTA CGAAGCTACT 

2651 GTCATCTACG TTGCCGATGT CTATCGTAAG AATCCTGACT GCACGACAGC 

2701 TCTCCTAATC AACAATACCT CGTGGAAAAC TACAGGAACG AATCTCTCAA 

2751 GACAAGCTGG TATCGGAAGA GCAGGGATCT TTTATGCCTT CTCTCCAAAT 

2 801 CTTGAGGTCA CAAGTAACCT ATCTATGGAA ATTCGTGGAT CTTCACGCAG 

2 851 CTACAATGCA GATCTTGGAG GTAAGTTCCA GTTC TAA AAG CG'1"I'CCTGA'I' 

2901 CCCTTAGAAA TTCTAAGAGA TCCTGAGTGT ATCTAGGGAC TTCTCAAAGA 



SEQUENCE ALIGNMENT 



101 ATGAAATCCTCTCTTC ATTGGTTTTTAATCTCGTC ATCTTTAGC ACTTCC 150 

NIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIII 

1 MetLysSerSeriieuHisTrpPheLeuXleserSerSerLeuAlaLeuPr 17 
151 CTTGTCACTAAATTTCTCTGCGTTTGCTGCTGTTGTTGAAATCAATCTAG 2 00 

IIIIIMIIIIIIIIIIIIIMIIIIIMIMIIlMlllllllllMM 

18 oLeuSerLeuAsnPheSerAlaPheAlaAlaValValGluIleAsnLeuG 3 4 
201 GACCTACCAATAGCTTCTCTGGACCAGGAACCTACACTCCTCCAGCCCAA 2 50 

IIIIIIIIIIIIIIMIMIIMIIMIIIIIIIIIMIMIIMIIIII 

3 5 lyProThrAsnSerPheSerGlyProGlyThrTyrThrProProAlaGln 50 
251 ACAACAAATGCAGATGGAACTATCTATAATCTAACAGGGGATGTCTCAAT 300 

IMIIMIIIIMIIIIIMMIIIIIIIIIIIIIIIMMIIIIIIIII 

51 ThrThrAsnAlaAspGlyThrlleTyrAsnLreuThrGlyAspValSerll 67 
301 C ACCAATGC AGGATCTCCGACAGCTCTAACCGCTTCCTGCTTTAAAGAAA 3 50 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIII 

68 eThrAsnAlaGlySerProThrAlaLeuThrAlaSerCysPheLysGluT 84 
3 51 CTACTGGGAATCTTTCTTTCCAAGGCCACGGCTACCAATTTCTCCTACAA 400 

IIIIMIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIMII 

85 hrThrGlyAsnLeuSerPheGlnGlyHisGlyTyrGlnPheLeuLeuGln 100 
401 AATATCGATGCGGGAGCGAACTGTACCTTTACCAATACAGCTGCAAATAA 450 

IIIIMIMIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIII 

101 AsnTleAspAlaGlyAlaAsnCysThrPheThrAenTlirAlaAloLAenLy 117 
451 GCTTCTCTCCTTTTCAGGATTCTCCTATTTGTCACTAATACAAACCACGA 500 

MIIIMMIIIMIIIIIIIIIIMIMIMIIIIIIMIIIIIIIIII 

118 sLreuLeuSerPiieSerGlyPJieSerTyrLeuSerLeuIleGlnThrThrA 134 
501 ATGCTACCACAGGAACAGGAGCCATCAAGTCCACAGGAGCTTGTTCTATT 550 

IIIINIIIMIIIMIIIIIIIIIIMIIMIIIIIIMIIIIIIIMI 

13 5 snAlaThrThrGlyThrGlyAlalleLysSerThrGlyAlaCysSerlle 150 
551 CAGTCGAACTATAGTTGCTACTTTGGCCAAAACTTTTCTAATGACAATGG 600 

IMIIIIIMIIIIIMMIIIIIIIIIIIIIIIMIIMIIIIMMII 

151 GlnSerAsnTyrSerCysTyrPheGlyGlnAsnPheSerAsnAspAsnGl 167 
601 AGGCGCCCTCCAAGGCAGCTCTATCAGTCTATCGCTAAACCCCAACCTAA 650 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIlllll 

168 yGlyAlaLeuGlnGlySerSerlleSerLeuSerLeuAsnProAsnLeuT 184 
6 51 CGTTTGCCAAAAACAAAGCAACGCAAAAAGGGGGTGCCCTCTATTCCACG 700 

IIMIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIMMIIIIIIM 

185 hrPheAlaLysAsnLysAlaThrGlnLysGlyGlyAlaLeuTyrSerThr 200 
701 GGAGGGATTACAATTAACAATACGTTAAACTCAGCATCATTTTCTGAAAA 750 

lllllllllllllllllllllllllllllllliiiillllllllllllll 

2 01 GlyGlylleThrlleAsnAsnThrLeuAsnSerAlaSerPheSerGluAs 217 
751 TACCGCGGCGAACAATGGCGGAGCCATTTACACGGAAGCTAGCAGTTTTA 800 

IIIIIIMIIIIIIIIIIIIIIIIMIMIIMIIIIIIIIIIIIIIIM 

218 nThrAlaAlaAsnAsnGlyClyAlalleTyrThrGluAlaSerSGrPhel 23 4 



801 TTAGCAGCAACAAAGCAATTAGCTTTATAAACAATAGTGTGACCGCAACC 850 

MIIIMIIMIIIIIMIIIIIIMIMIIMIMIIIIIIMMIIII 

23 5 leSerSerAsnLysAlalleSerPhelleAsnAsnSerValThrAlaThr 250 
851 TCAGCTACAGGGGGAGCCATTTACTGTAGTAGTACATCAGCCCCCAAACC 900 

IMIIIIIIMMMIIIMIIIIIIIIIIIIIIIIIIIIMIIMIIII 

2 51 SerAlaThrGlyGlyAlalleTyrCysSerSerThrSerAlaProLysPr 2 67 
901 AGTCTTAACTCTATCAGACAACGGGGAACTGAACTTTATAGGAAATACAG 950 

IIIIIMIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMII 

268 ovallieuThrLeuSerAspAsnGlyGlxiLeuAsnPhelleGlyAsnThrA 2 84 
951 C AATTACTAGTGGTGGGGCGATTTATACTGAC AATCT AGTTCTTTC TTCT 1000 

IIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIMM 

285 lalleThrSerGlyGlyAlalleTyrThrAspAsnLeuValLeuSerSer 3 00 
1001 GGAGGACCTACGCTTTTTAAAAACAACTCTGGCTATGATACTGCAGCTCC 1050 

IIMIMIIIMIMIIIIIIIIIIIIIIMMIIIIIIIIIMIMIII 

301 GlyGlyProThrLeuPheLysAsnAsnSerGlyTyrAspThrAlaAlaPr 317 
1 n R n CTTAGGAGGAQC AATTGCGATTGCTGACTCTGGATCTTTGAGTCTTTCGG 1100 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 

318 oLeuGlyGlyAlalleAlalleAlaAspSerGlySerLeuSerLeuSerA 3 34 
1101 CTCTTGGTGGAGACATCACTTTTGAAGGAAACACAGTAGTCAAAGGAGCT 1150 

MIIIIIIIIIIMIIIIIIIIMIIIIIIIIMIIIIIIIIIIMIIII 

335 laLeuGlyGlyAspIleThrPheGluGlyAsnThrValValLysGlyAla 350 
1151 TCTTCGAGTCAGACCACTACCAGAAATTCTATTAACATCGGAAACACCAA 1200 

IIMIIIIMMIIIIIIIIMMIIIIIIIIIINIIIIIMIIIIIII 

351 SerSerSerGlnThrThrThrArgAsnSerlleAsnlleGlyAsnThrAs 367 

12 01 TGCT AAGATTGTAC AGCTGCGAGCCTCTC AAGGC AATACTATCTACTTCT 1250 

IIIIIIIMIIIIIIIIMIMIIIIMIIIIIIIIIIIIIIIIIIIIII 

3 68 DAlaLysTleValGlnLeuArgAlaSerGlnGlyAsnThrlleTyrPheT 3 84 
1251 ATGATCCTATAAC AACTAGC ATC ACTGC AGCTCTCTC AGATGCTCTAAAC 13 00 

IIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIMI 

385 yrAspProIleThrThrSerlleThrAlaAlaLeuSerAspAlaLeuAsn 400 

13 01 TTAAATGGTCCTGACCTTGCAGGGAATCCTGCATATCAAGGAACCATCGT 1350 

IIIMIIIIIIIMIIIIIMIIIIIIIIIMIIIIMMIIIIIMIII 

401 LeuAsnGlyProAspLeuAlaGlyAsnProAlaTyrGlnGlyThrlleVa 417 

13 51 ATTTTCTGGAGAGAAGCTCTCGGAAGCAGAAGCTGCAGAAGCTGATAATC 1400 

IIMIIIIMMIIIMMIMMIIIIIIIIMIIIMIIMIIIIIII 

418 iPheSerGlyGluLysLeuSerGluAlaGluAlaAlaGluAlaAspAsnL 434 

14 01 TO AAATCTAC AATTC AGCAACCTCTAACTCTTGCGCG ACCGC AACTCTCT 14 50 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIII 

43 5 euLysSerThrlleGlnGlnProLeuThrLeuAlaGlyGlyGlnLeuSer 450 



1451 
451 



CTTAAATCAGGAGTCACTCTAGTTGCTAAGTCCTTTTCGCAATCTCCGGG 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMillllllllllllll 

LeuLysSerGlyValThrLeuValAlaLysSerPheSerGlnSerProGl 



1500 



467 



1501 CTCTACCCTCCTCATGGATGCAGGGACCACATTAGAAACCGCTGATGGGA 1550 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

468 ySerThrLeuLeuMetAspAlaGlyThrThrLeuGluThrAlaAspGlyl 484 

1551 TC ACTATC AATAATCTTGTTCTC AATGTAGATTCCTTAAAAGAGACC AAG 1600 

llllllllllllllllllllllllllllllllllllllllllllllllll 
485 leThrlleAsnAsnLeuValLeuAsnValAspSerLeuLysGluThrLys 500 

1601 AAGGGCACGCTAAAAGCAACACAAGCAAGTCAGACAGTCACTTTATCTGG 1650 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIMIIIIIIIIII 
501 LysGlyThrLeuLysAlaThrGlnAlaSerGlnThrValThrLeuSerGl 517 

1651 ATCGCTCTCTCTTGTAGATCCTTCTGGAAATGTCTACGAAGATGTCTCTT 1700 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIII 

518 ySerLeuserLeuvalAspProSerGlyAsriValTyrGluAspValSex'T 53 4 

1701 GGAATAACCCTCAAGTCTTTTCTTGTCTCACTCTTACTGCTGACGACCCC 1750 

lllllllllllllllllllllllilMIIIIIIIIIIIIIIIIIIIIIII 
535 rpAsnAsnProGlnValPheSerCysIjeuThrLeuThrAlaAspAspPro 550 

1751 GCGAATATTCACATCACAGACTTAGCTGCTGATCCCCTAGAAAAAAATCC 1800 

IIIIIMMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

551 AlaAsnlleHisIleThrAspLeuAlaAlaAspProLeuGluLysAsnPr 567 
1801 TATCCATTGGGGATACCAAGGGAATTGGGCATTATCTTGGCAAGAGGATA 1850 

MIIMIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIMIIIMIIIII 

568 oIleHisTrpGlyTyrGlnGlyAsnTrpAlaLeuSerTrpGlnGlxiAspT 584 
1851 CTGCGACTAAATCCAAAGCAGCGACTCTTACCTGGACAAAAACAGGATAC 1900 

IIIIIIIIIIIIIIIIIIIIIMMIIIIIIIMIIIIIIIMIIIIIM 

585 hrAlaThrLysSerLysAlaAlaThrLeuThrTrpThrLysThrGlyTyr 600 
1901 AATCCGAATCCTGAGCGTCGTGGAACCTTAGTTGCTAACACGCTATGGGG 1950 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMM 

601 AsnProAsnProGluArgArgGlyThrLeuValAlaAsnThrLeuTrpGl 617 

1951 ATCCTTTGTTGATGTGCGCTCCATACAACAGCTTGTAGCCACTAAAGTAC 2 000 

I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I 
618 ySerPheValAspValAroSerTI eGTnGlnLeuValAlaThrLysValA 634 

2 001 GCC AATCTC AAGAAACTCGCGGC ATCTGGTGTGAAGGGATCTCGAACTTC 2050 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

63b rgGlnSerGlnGluThrArgGlylleTrpCysGluGlylleSerAsnPhe 650 

2 051 TTCCATAAAGATAGCACGAAGATAAATAAAGGTTTTCGCCACATAAGTGC 2100 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIII 
651 PheHisLysAspSerThrLysIleAsnLysGlyPheArgHisIleSerAl 667 

2101 AGGTTATGTTGTAGGAGCGACTAC AAC ATTAGCTTCTGATAATCTTATC A 2150 

llllllllllllllllllllllllllllllllllllllllllllllllll 
668 aGlyTyrValValGlyAlaThrThrThrLeuAlaSerAspAsnLeuIleT 684 

2151 CTGCAGCCTTCTGCCAATTATTCGGGAAAGATAGAGATCACTTTATAAAT 2200 

MIMIIIIIIIIIIIIIIIIIMIIMIIMIIIIIIIIIMMMIM 

685 hrAlaAlaPheCysGlnLeuPheGlyLysAspArgAspHisPhelleAsn 700 
2201 AAAAATAGAGCTTCTGCCTATGCAGCTTCTCTCCATCTCCAGCATCTAGC 2250 



IMIIMIIIIIIMIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

701 LysAsnArgAlaSerAlaTyrAlaAlaSerLeuHisLeuGlnHisLeuAl 717 
2251 GACCTTGTCTTCTCCAAGCTTGTTACGCTACCTTCCTGGATCTGAAAGTG 23 00 

IIIIIIIIMMMMMIIMIMIIIIMIIIIMMIIIIMIIIII 

718 aThrLeuSerSerProSerLeuLexiArgTyrLeuProGlySerGluSerG 734 
23 01 AGCAGCCTGTCCTCTTTGATGCTC AGATC AGCT AT ATCTATAGTAAAAAT 2 3 50 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIM 

73 5 luGlnProValLeuPheAspAlaGlnlleSerTyrlleTyrSerLysAsn 750 
23 51 ACTATGAAAACCTATi'AC ACCC AAGC ACCAAAGGGAGAGAGCTCGTGGTA 2400 

MIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIMIIillllll 

751 ThrMetLysThrTyrTyrThrGlnAlaProLysGlyGluSerSerTrpTy 767 
2401 TAATGACGGTTGCGCTCTGGAACTTGCGAGCTCCCTACCACAC ACTGCTT 2450 

IMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIMIIIIIII 

768 rAsnAspGlyCysAlaLeuGluLeuAlaSerSerLeuProHisThrAlaL 784 
2451 TAAGCCATGAGGGTCTCTTCCACGCGTATTTTCCTTTC ATCAAAGTAGAA 2500 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIII 

785 eu53erHisGluGlyI.eiiPheHi sAlaTyrPheProPhelleLysValGlu 800 
2501 GCTTCGTACATACACCAAGATAGCTTCAAAGAACGTAATACTACCTTGGT 2 550 

llllllllllllllllllllllllllllllllllllllllllllllllll 

801 AlaSerTyrlleHisGlnAspSerPheLysGluArgAsnThrThrLeuVa 817 
2 551 ACGATCTTTCGATAGCGGTGATTTAATTAACGTCTCTGTGCCTATTGGAA 2600 

MIIMIIIMIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIMIII 

818 lArgSerPheAspSerGlyAspLeuIleAsnValSerValProIleGlyl 834 
2601 TTACCTTCGAGAGATTCTCGAGAAACGAGCGTGCGTCTTACGAAGCTACT 2 650 

IMIIIIIIIIIIIIIMIMIIIIIIIIMIIIIIIMMIIIIIIIII 

83 5 leThrPheGluArgPheSerArgAsnGluArgAlaSerTyrGluAlaThr 850 
2 6 51 GTC ATCTACGTTGCCGATGTCTATCGTAAGAATCCTGACTGCACGAC AGC 27 00 

IIIIMIIMIIMIIIIMMMIIIIIMIMIIMIIIIIIIIIIII 

851 VallleTyrValAlaAspValTyrArgLysAsnProAspCysThrThrAl 867 
2701 TCTCCTAATCAACAATACCTCGTGGAAAACTACAGGAACGAATCTCTCAA 2750 

llllllllllllilllllllllllllMIIIIIIIIIIIIIIIIIIIIII 

868 aLeuLeuIleAsnAsnThrSerTrpLysThrThrGlyThrAsnLeuSerA 884 
2751 GACAAGCTGGTATCGGAAGAGCAGGGATCTTTTATGCCTTCTCTCCAAAT 2 800 

IIMIIIIIIillllMMIIIIIIIIIIIIIIIIMMMIIIIIIIM 

885 rgGlnAlaGlylleGlyArgAlaGlyllePheTyrAlaPheSerProAsn 900 
2801 CTTGAGGTCACAAGTAACCTATCTATGGAAATTCGTGGATCTTCACGCAG 2850 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIII 

901 LeuGluValThrSerAsnLeuSerMetGluIleArgGlySerSerArgSe 917 
2851 CTACAATGCAGATCTTGGAGGTAAGTTCCAGTTC 2884 

IIMIIIIIIIMIIIIIIIMMIIIIIIMII 

918 rTyrAsnAlaAspLeuGlyGlyLysPheGlnPhe 92 8 



RESTRICTION MAP 



Hinf I 
Tfil Msel 



Bfal 
Spell 



Tsp509l 
Msel I 
Mnll I 



BsmAI Mnll I I CviRI 

I I M I I II I 
TGTAGATTCTTAACTTACTAGTCTCTCCTTTCCTCTTGCTTTCTTTAATTTATTGCAGTA 
1 + + + + + + 60 

ACATCTAAGAATTGAATGATCAGAGAGGAAAGGAGAACGAAAGAAATTAAATAACGTCAT 



MboII 

Msel CviJI Mnll Mnll 

Tsp509I HphI I Sfcl I Nlallll Earl j 

III II II II 

TGTGGTGAAATAATTTGTTAAACCACCTATAGCCCTCTACATGAAATCCTCTCTTCATTG 

ACACCACTTTATTAAACAATTTGGTGGATATCGGGAGATGTACTTTAGGAGAGAAGTAAC 



120 



Bbvl 
Apol I 
Tsp509l I Fnu4HI 
Maelll I I Mwol | 

Msel Tsp45l I I Tselj 

I I I I II 

GTTTTTAATCTCGTCATCTTTAGCACTTCCCTTGTCACTAAATTTCTCTGCGTTTGCTGC 

CAAAAATTAGAGCAGTAGAAATCGTGAAGGGAACAGTGATTTAAAGAGACGCAAACGACG 

NlalV 
BseRI 
Bpml 
ScrFI 
EcoRII 

Avail Avail 
ECOO109I BspGI 
PspSlI Sau96l 
Sau96l Hpyl78lll| BsaXI 

Sse8647l Alul | Alol 

Bfal I CviJI I Ppil 

II II 
TGTTGTTGAAATCAATCTAGGACCTACCAATAGCTTCTCTGGACCAGGAACCTACACTCC 

181 + + + + + + 240 

ACAACAACTTTAGTTAGATCCTGGATGGTTATCGAAGAGACCTGGTCCTTGGATGTGAGG 



241 



Bed 

Mnll Tthlllllj 
CviJI I CviRI I I HphI BsmAI 

II I II II 

TCCAGCCCAAACAACAAATGCAGATGGAACTATCTATAATCTAACAGGGGATGTCTCAAT 

+ 1- 1- h ( h 

AGGTCGGGTTTGTTGTTTACGTCTACCTTGATAGATATTAGATTGTCCCCTACAGAGTTA 

Alul 
CviJI 
Alwl I 
HpylBSiX I I 

Dpnl I I I Bsri 



300 



Fokl 



BstYI 
Sau3AI 
CviRI I 
I I 



I I I I 
I I I I 



I I 



I I 



Acelll 
Acil I 

Mwol I j 

II 



Dral 
Msel I 
Mmel I I 
II 



Hinfl 
Tfil 
Bmrl I 
I I 



301 



CACCAATGCAGGATCTCCGACAGCTCTAACCGCTTCCTGCT'ri'AAAGAAACTACTGGGAA 
+ + ^ ^ ^ ^ 

GTGGTTACGTCCTAGAGGCTGTCGAGATTGGCGAAGGACGAAATTTCTTTGATGACCCTT 



360 



BsaJI 
BstDSI 
CviJI 
Hael 
Haelll 
BsaJI I 

I I 



Clal 
Taqi 
Faul 
Sthl32l| 

Tsp509I SfaNI I I 

ICviJT I Bcefl- I III AciX 

II III! 

TCTTTCTTTCCAAGGCCACGGCTACCAATTTCTCCTACAAAATATCGATGCGGGAGCGAA 

361 + + + + + + 420 

AGAAAGAAAGGTTCCGGTGCCGATGGTTAAAGAGGATGTTTTATAGCTACGCCCTCGCTT 

Alul 
CviJT 
Hindlll 
CviRI 
Fnu4HI 



Bbvl 
RsaT I 
Taal I I 
I I I 



Alul I 
CviJI I 
MspAlI j 
PvuXI I 
Tsel I 
II 



Hinfl 
Tfil 
Hpyl78lII I 
I I 



Maelll 
Tsp45l 



CTGTACCTTTACCAATACAGCTGCAAATAAGCTTCTCTCCTTTTCAGGATTCTCCTATTT 
421 + + + + + + 

GACATGGAAATGGTTATGTCGACGTTTATTCGAAGAGAGGAAAAGTCCTAAGAGGATAAA 



480 



Bed 

CviJI I Alul 
BsmI NlaIV| j CviJI 

I III I 

GTCACTAATACAAACCACGAATGCTACCACAGGAACAGGAGCCATCAAGTCCACAGGAGC 

481 + + + + + + 540 

C AGTGATTATGTTTGGTGCTTAC GATGGTGTCCTTGTCCTC GGTAGTTC AGGTGTC CTCG 

CviJI 
Hael 
Haelll 
Msci 
Eael I 
Mwol I I 

TaqI Sfcl CjePI | | | Mnll 

II I II I I 

TTGTTCTATTCAGTCGAACTATAGTTGCTACTTTGGCCAAAACTTTTCTAATGACAATGG 

541 + + + + + + 600 

AACAAGATAAGTCAGCTTGATATCAACGATGAAACCGGTTTTGAAAAGATTACTGTTACC 



BsaJI 
Haell I 



Hhal 
Hin4l 
NlalV 
BsaHI 
Nan 
BanI I 
CjePlj 



Alul 
CviJI 
Mnll I 
Fnu4HI I I 
Styl Tsel| | | 
I MM 



Acelll 
Bbvl I 
BplI I I 
I M 



CjePI 



Acll 
Maell 



601 



AGGCGCCCTCCAAGGCAGCTCTATCAGTCTATCGCTAAACCCCAACCTAACGTTTGCCAA 



-+ 660 



TCCGCGGGAGGTTCCGTCGAGATAGTCAGATAGCGATTTGGGGTTGGATTGCAAACGGTT 



Sthl32l 
Bspl286l 
Bmgl 
BseSI 
NlaIV| 
BanI I I 
I I 



BslI 
BscGI 
MnlX 
BsaJI I 
BstDSI I 
Mnll I I 
I I 



Msel 
Tsp509l I 
I I 



AAACAAAGCAACGCAAAAAGGGGGTGCCCTCTATTCCACGGGAGGGATTACAATTAACAA 

661 + + + + + + 720 

TTTGTTTCGTTGCGTTTTTCCCCCACGGGAGATAAGGTGCCCTCCCTAATGTTAATTGTT 



Fnu4HI 
SacII 
Taul 
Acil I 
MspAlI I 

Thai I Cviai 
BseMII Acil | | NlaIV| 

Hpyl88IX BsaJI | | Ecil | | 

SfaNit BstDSI I I Acil I | j 

M I M M M 

TACGTTAAAC TC AGC ATC ATTTTCTGAAAAT ACCGC GGC GAAC AATGGCGGAGCC ATTTA 

721 + + + + + + 780 

ATGCAATTTGAGTCGTAGTAAAAGACTTTTATGGCGCCGCTTGTTACCGCCTCGGT7VAAT 

Cac8I 
Bfal 
Alul I 

CviJI j I Fnu4HI Bbvl Alul Maelll 

Nheljj Tsel| Tsp509l CviJI Tsp45l 

Ml Mil I 



Msel 
Maell I Ddel 
I I I 



781 



CACGGAAGCTAGCAGTTTTATTAGCAGCAACAAAGCAATTAGCTTTATAAACAATAGTGT 
+ + + 4- + 4. 

GTGCCTTCGATCGTCAAAATAATCGTCGTTGTTTCGTTAATCGAAATATTTGTTATCACA 



CviJI 
NlalV 
BseMII 
Mnll 
Cjel 



840 



Sfcl 
Alul I 
CviJI I 
Mwol I I 
BbvCI ! i I 



BpulOll II II III Taal Rsal Cjel 

Acil Ddel j II II ill Sf cl I Tati | CviJI | BsrI 

I II II II III II I I II I 
GACCGCAACCTCAGCTACAGGGGGAGCCATTTACTGTAGTAGTACATCAGCCCCCAAACC 
841 + + + + + + 900 

CTGGCGTTGGAGTCGATGTCCCCCTCGGTAAATGACATCATCATGTAGTCGGGGGTTTGG 



Hpyl88lX Tsp509I Bfal 

Msel Sthl32l j BscGI Cjel j Spel | 

I I I I I I II 

AGTCTTAACTCTATCAGACAACGGGGAACTGAACTTTATAGGAAATACAGCAATTACTAG 

901 + + + + + + 960 

TCAGAATTGAGATAGTCTGTTGCCCCTTGACTTGAAATATCCTTTATGTCGTTAATGATC 



Avail 
ECOO109I 
Psp5II 
Sau96I 

Sse8647l Bpml 
Mboll Hpyl78III I Dral | 

Cjel Bfal I Mnll | j Msel | | 

III III III 

TGGTGGGGCGATTTATACTGACAATCTAGTTCTTTCTTCTGGAGGACCTACGCTTTTTAA 

961 + + + + + + 1020 

ACCACCCCGCTAAATATGACTGTTAGATCAAGAAAGAAGACCTCCTGGATGCGAAAAATT 



Muni 
Tsp509l 
Acelll 
Bsu36I 
Ddel 
Mnll 



CviJI 



Alul 
CviJI 
PstI 
BsaXI 
Fnu4HI 
CviRI 
Tsel 
Sfcl I 

I I 



Bbvl 
BslI 
I 



HaelV 
BseRI Hin4l 
Plel|HinfI I 

II II 



AAACAACTCTGGCTATGATACTGCAGCTCCCTTAGGAGGAGCAATTGCGATTGCTGACTC 

1021 + + + + + + 1080 

TTTGTTGAGACCGATACTATGACGTCGAGGGAATCCTCCTCGTTAACGCTAACGACTGAG 



Dpnl 

BstYI I BsmAI 

Sau3AI I Alwl CviJI | 

Hpyl78IIl| I Hinfl Plel | | Taal 

II I I I I I I 

TGGATCTTTGAGTCTTTCGGCTCTTGGTGGAGACATCACTTTTGAAGGAAACACAGTAGT 

1081 + + + + + + 1140 

ACCTAGAAACTCAGAAAGCCGAGAACCACCTCTGTAGTGAAAACTTCCTTTGTGTCATCA 



Alul Hpyl88lX 
CviJI Hinfl | Apol Hpyl88IX 

MboII I TaqI | | Plel T3p509l Msel | 



I I I I i I I I I 

CAAAGGAGCTTCTTCGAGTCAGACCACTACCAGAAATTCTATTAACATCGGAAACACCAA 
1141 + ^ + ^ ^ ^ 3^200 

GTTTCCTCGAAGAAGCTCAGTCTGGTGATGGTCTTTAAGATAATTGTAGCCTTTGTGGTT 



CviJI 
CacSI 
Fnu4Hl 
Alul| 
CviJI I 
MspAlI I 
Pvuli j 
Bce83I I I 

Rsal II [I Dpnl 
Bbvl BsrGI I II II Sau3AI | 

Ddel TatI iTselj | |SmlI Mnll Alwl | 

I I I II I I I I III 
TGCTAAGATTGTACAGCTGCGAGCCTCTCAAGGCAATACTATCTACTTCTATGATCCTAT 
1201 + + + + + + 1260 

ACGATTCTAACATGTCGACGCTCGGAGAGTTCCGTTATGATAGATGAAGATACTAGGATA 



Acelll 
Bbvl 
Hpyl88lX 
Alul 



Bfal 



CviJI 
PstI 
SfaNI 
TspRI 
Fnu4HI 
SfaNI 
CviRI I 
Tsei I 
Btsi I I 
Sfcl I I 

I M 



Ddel 



Hpyl7 8III 
Msel Avail I 
BseMII |Sau96l | CviRI 
I I I 



AACAACTAGCATCACTGCAGCTCTCTCAGATGCTCTAAACTTAAATGGTCCTGACCTTGC 
1261 + + + + + + 1320 

TTGTTGATCGTAGTGACGTCGAGAGAGTCTACGAGATTTGAATTTACCAGGACTGGAACG 



Hinfl 
Tfil CviRI 



Hpyl88lX 
Bbvl 
Alul 

Bed CviJI 
Drdll I Hin4l| 
Nlaivl I Hpyl78III Hin4I j | 



1321 



Bpml 

AGGGAATCCTGCATATCAAGGAACCATCGTATTTTCTGGAGAGAAGCTCTCGGAAGCAGA 
+ + h + + + 

TCCCTTAGGACGTATAQTTCCTTGGTAGCATAAAAGACCTCTCTTCGAGAGCCTTCGTCT 



1380 



Alul 
AlwNI 
CviJX 
Mwol I 
PstI I I 
CviRI I I I 



BplI 
Fnu4HI 
Sfcl 
Alul 
AlwNl 
BplI 
BstAPI 

CviJT III] I Acil 
Mwol I i I I I Faul Mnll | 

Tsel II II I Tsp509l Sthl32l |MnlI I | 

I II II I I II II I 

AGCTGCAGAAGCTGATAATCTCAAATCTACAATTCAGCAACCTCTAACTCTTGCGGGAGG 

1381 + + + + + + 1440 

TCGACGTCTTCGACTATTAGAGTTTAGATGTTAAGTCGTTGGAGATTGAGAACGCCCTCC 



Hin4I 
I 



Maelll 
Tsp45l 
Hinfl I 
BplI I I 
Hpyl78lll| I I 
Msel II I I 

MM 



Bfal 

Plel Ddel Sthl32l 

I I I 

GCAACTCTCTCTTAAATCAGGAGTCACTCTAGTTGCTAAGTCCTTTTCGCAATCTCCGGG 

1441 + + + + + + 1500 

CGTTGAGAGAGAATTTAGTCCTCAGTGAGATCAACGATTCAGGAAAAGCGTTAGAGGCCC 



CviJI 
BseRI 
Neil I 
ScrFI I 
Mspl I I 
II I 



Fokl 
Cjel 
NlalV 
Avail 



Sau96l 
CviRI 
Mnll 

SfaNI Mnll | 

Banll I Nlalll | | 
Bspl286l I BslI I I I 

II I II I 



Alwl 
BsaBi 
Dpnl 

MspAlI Sau3AI | 
Acil I Bed I I 
BsmFI I |Bsll| I I 

I II I I 



CTCTACCCTCCTCATGGATGCAGGGACCACATTAGAAACCGCTGATGGGATCACTATCAA 

1501 + + + + + + 1560 

GAGATGGGAGGAGTACCTACGTCCCTGGTGTAATCTTTGGCGACTACCCTAGTGATAGTT 



Bspl286I 

Bsal CacSl 
Hinfl BsmAI Bmgl | 

Cjel Tfil Msel BseSI | Bsbl 

I II II I 

TAATCTTGTTCTCAATGTAGATTCCTTAAAAGAGACCAAGAAGGGCACGCTAAAAGCAAC 

1561 + + + + + + 1620 

ATTAGAACAAGAGTTACATCTAAGGAATTTTCTCTGGTTCTTCCCGTGCGATTTTCGTTG 



Tthlllll 
Maelll 
Taal 
Tsp45l 
Hpyl88lX I 
Mwol Cjel I I 



Hpyl78lII 
Dpnl 
BstYI 



Sau3AI 

Dpnl Cjel I 

Sau3AI I Alwl I I 

Hpyl78lII I I Alwl | | | 



I II I I II I I I I II I 

ACAAGCAAGTCAGACAGTCACTTTATCTGGATCGCTCTCTCTTGTAGATCCTTCTGGAAA 

1621 + + + + + + 1680 

TGTTCGTTCAGTCTGTCAGTGAAATAGACCTAGCGAGAGAGAACATCTAGGAAGACCTTT 



Pf 111081 
Acci I 
CjePlI I 
II 



MboII 
BsmAI I 
Bce83I I I 

I II 



CjePI 
Sxnll |MnlI 

I I I 



BsmAI 



TGTCTACGAAGATGTCTCTTGGAATAACCCTCAAGTCTTTTCTTGTCTCACTCTTACTGC 

1681 -H + + + + + 1740 

ACAGATGCTTCTACAGAGAACCTTATTGGGAGTTCAGAAAAGAACAGAGTGAGAATGACG 



Faul 
Sspl 
Sthl32lt 
Thai I [ 
Siml Acil I I [ 
I I I II 



Dpnl 
Sau3AI 
Fnu4HI 
Alul 



Alwl I 
Cvi JI I 
Tsel I 

Bbvl Ddel | | | | Bfal 

I I I 

TGACGACCCCGCGAATATTCACATCACAGACTTAGCTGCTGATCCCCTAGAAAAAAATCC 

1741 + + + + + + 1800 

ACTGCTGGGGCGCTTATAAGTGTAGTGTCTGAATCGACGACTAGGGGATCTTTTTTTAGG 



Tsp509l 
BsaJI j 
Styl' t 



BciVI 
Mnll I 



BciVI 
I 

TATCCATTGGGGATACCAAGGGAATTGGGCATTATCTTGGCAAGAGGATACTGCGACTAA 

1801 + + + + + + 1860 

ATAGGTAACCCCTATGGTTCCCTTAACCCGTAATAGAACCGTTCTCCTATGACGCTGATT 



Hinf I 
Fnu4HI I 
Tsel| I 
Plel I I I 
III I 



BspGI 
ScrFI I 
Bbvl I I 
EcoRII I I BciVI 
III I 



BpulOI 
Ddel 
Hpyl78III 
Hinf I 
Tf il 
Hgal I 
Hpyl88lX 1 
BseMII I I 
I I I 



ATCCAAAGCAGCGACTCTTACCTGGACAAAAACAGGATACAATCCGAATCCTGAGCGTCG 

1861 + + + + + + 1920 

TAGGTTTCGTCGCTGAGAATGGACCTGTTTTTGTCCTATGTTAGGCTTAGGACTCGCAGC 



Cjel 
Ddel I 
NlalV I I 



Dpnl 
NlalV 
BomHI 
BStYI 
SauBAI 
Alwl I 
I I 



Hhal 
Hin4I I 
Alwl Cjel I I 
I III 



TGGAACCTTAGTTGCTAACACGCTATGGGGATCCTTTGTTGATGTGCGCTCCATACAACA 

1921 + + + + + + 1980 

AC CTTGGAATC AACGATTGTGCGATAC CCCT AGGAAAC AACTAC AC GC GAGGTATGTTGT 



Alul 
CviJI 



Bce83I 
CviJI I 



Rsal 



I 



Fnu4HI Dpnl 

Taul BstYI j 

Hpyl78lII Acil| Sau3AI j 

Smll I Thaij SfaNi | | 

II II III 



GCTTGTAGCCACTAAAGTACGCCAATCTCAAGAAACTCGCGGCATCTGGTGTGAAGGGAT 

1981 + + + + + + 2040 

CGAACATCGGTGATTTCATGCGGTTAGAGTTCTTTGAGCGCCGTAGACCACACTTCCCTA 



XmnI 
Alwl 
Hpyl78lII I 
TaqI I 
Mbolll I 



MslI 

MboIT BspMI I CviRI 

I II 



CTCGAACTTCTTCCATAAAGATAGCACGAAGATAAATAAAGGTTTTCGCCACATAAGTGC 

2041 + + + + + + 2100 

GAGCTTGAAGAAGGTATTTCTATCGTGCTTCTATTTATTTCCAAAAGCGGTGTATTCACG 



CviJI 
PstI I 
TspRI I 
Fnu4HI I 
CviRI I 

Hpyl88IX Tsel j 

Alul I BtsI I I 

Bsgl CviJI I Sf cl I I 

I II I II 

AGGTTATGTTGTAGGAGCGACTACAACATTAGCTTCTGATAATCTTATCACTGCAGCCTT 

2101 + + + + + + 2160 

TCCAATACAACATCCTCGCTGATGTTGTAATCGAAGACTATTAGAATAGTGACGTCGGAA 



Bbvl Dpnl 
Tsp509l Sau3AI j Alul 

Sthl32I I Hpyl78lII BsaBI | | CviJI 

. II I - M I I 

CTGCCAATTATTCGGGAAAGATAGAGATCACTTTATAAATAAAAATAGAGCTTCTGCCTA 

2161 + + + + + + 2220 

GACGGTTAATAAGCCCTTTCTATCTCTAGTGAAATATTTATTTTTATCTCGAAGACGGAT 



Alul 
CviJI 
Fnu4HI 
Bpml I 
CviRI I 
Mwol I 
Tsel I 
II 



Tthllll 
SfaNI 
Bbsl I 

Bbvl Mbollj 
Bed Bf al I I 

I III 
TGCAGCTTCTCTCCATCTCCAGCATCTAGCGACCTTGTCTTCTCCAAGCTTGTTACGCTA 

2221 + + + + + + 2280 

ACGTCGAAGAGAGGTAGAGGTCGTAGATCGCTGGAACAGAAGAGGTTCGAACAATGCGAT 



Mwol 
Maelll 
Alul 
CviJI 
Hindlll I 
I I 



Hpyl88lX 



Sfcl 
BseMII I 
Alul I I 
CviJI I I 



Dpnl 
BstYI 

Sau3AI 
ScrFI I 
EcoRII I I 
I I I 



Alwl 



SfaNI 
CviJI I 
Fnu4HI I I 
Tsel I I I 

I I 



Bbvl 



I 



Dpnl 
Sau3AI 
Hpyl88lX| 
Ddel I I 
Miili I I I 

II II 



CCTTCCTGGATCTGAAAGTGAGCAGCCTGTCCTCTTTGATGCTCAGATCAGCTATATCTA 

2281 + + + + 4- + 2340 

GGAAGGACCTAGACTTTCACTCGTCGGACAGGAGAAACTACGAGTCTAGTCGATATAGAT 



Ban 1 1 
BS iHKAI 
Bspl286l 
Sad 
BssST I 
Alul I 
CviJI I 
Tthlllll I I 
CjePI Taqll | | j 

I I I II 

TAGTAAAAATACTATGAAAACCTATTACACCCAAGCACCAAAGGGAGAGAGCTCGTGGTA 
2341 + + + + + + 2A00 

ATCATTTTTATGATACTTTTGGATAATGTGGGTTCGTGGTTTCCCTCTCTCGAGCACCAT 



Hpyl78lII 
Hhal I 
CjePI I I 
Taal I I I 



Banll 

BS iHKAI 
Bspl286l 
Sad 
Alul I 
CviJI j 
Cac8l I I 



Nlalll 
Mboll 
CviJI 
Mnll 



Msel I 
TspRI I I 
BtSi I I I 

I III. .. 

TAATGACGGTTGCGCTCTGGAACITGCGAGC'l'CCCTACCACACACTGCTTTAAGCCATGA 

2401 + + + + + + 2460 

ATTACTGCCAACGCGAGACCTTGAACGCTCGAGGGATGGTGTGTGACGAAATTCGGTACT 

Thai 
Afllll 
Mlul 

Earl I I Alul 
Bsal I j CviJI 
Simi BsniAI | | Hindlll I Rsal 

I I I I III 

GGGTCTCTTCCACGCGTATTTTCCTTTCATCAAAGTAGAAGCTTCGTACATACACCAAGA 

2461 + + + + + 2520 

CCCAGAGAAGGTGCGCATAAAAGGAAAGTAGTTTCATCTTCGAAGCATGTATGTGGTTCT 



Dpnl 

Alul BsaJI Sau3AI | 

CviJI Maell Styl Rsal | | TaqI Acil 

I I I I II I I 



Maell 
HphI 
Msel 
Pad 
Tsp509l I 
Msel I I 
II I 



TAGCTTCAAAGAACGTAATACTACCTTGGTACGATCTTTCGATAGCGGTGATTTAATTAA 

2521 + + + + + + 2580 

ATCGAAGTTTCTTGCATTATGATGGAACCATGCTAGAAAGCTATCGCCACTAT^TT/^TT 



Hgal 
Hpyl78III 
Hpyl78III 
Taqi 
Aval I 
Smll I 
Hin^T I I 

Hinf I I I I I BplI 
BsmAI Hpyl78lII |Xhol| | | Cac8l | 

BsmBI Tsp509l TaqI |TfiI I | | I Hin4l | 

I I I I I I I I I II 

CGTCTCTGTGCCTATTGGAATTACCTTCGAGAGATTCTCGAGAAACGAGCGTGCGTCTTA 

2581 + + + + + + 2640 

GCAGAGACACGGATAACCTTAATGGAAGCTCTCTAAGAGCTCTTTGCTCGCACGCAGAAT 



Alul 
CviJI Taal 



Maell 



Hpyl78III 

Bsgl Hinfl I Alul 

BsaBI I Tfil | CviRI CviJI 

I I I I I I 



CGAAGCTACTGTCATCTACGTTGCCGATGTCTATCGTAAGAATCCTGACTGCACGACAGC 

2641 + + + + + + 2700 

GCTTCGATGACAGTAGATGCAACGGCTACAGATAGCATTCTTAGGACTGACGTGCTGTCG 



Bce83I Hpyl78lII 
Sfcl| Hinfl I Alul 

Acelll BssSI Mnll | j Tfil Smll j CviJI 

I I III I I I I 

TCTCCTAATCAACAATACCTCGTGGAAAACTACAGGAACGAATCTCTCAAGACAAGCTGG 

2701 + + + + + + 2760 

AGAGGATTAGTTGTTATGGAGCACCTTTTGATGTCCTTGCTTAGAGAGTTCTGTTCGACC 



HpylSSIX 
Earl I 
Sapl I 
I I 



2761 



MboII Maelll 
Dpnl I Tsp4 5l 

BstYI I I Hpyl78III j 
Sau3AI I I Smll | | 

Hin4I j I I Alwl Mnll j j | Maelll 

II II I I II I I 

TATCGGAAGAGCAGGGATCTTTTATGCCTTCTCTCCAAATCTTGAGGTCACAAGTAACCT 

ATAGCCTTCTCGTCCCTAGAAAATACGGAAGAGAGGTTTAGAACTCCAGTGTTCATTGGA 



2820 



Dpnl 
BstYI 
Sau3AI 
MboII 
Apol 



Tsp509l. 
BslI I 
Bcc83I I I 
I I I 



Alul 
CviJI 
Fnu4HI I 
Tsel I I 
Alwl I I I 

I II I 



Dpnl 
Mnll 
Bglll 
BstYI 
Sau3AI 
Bbvlj 
CviRI I I 

III 



Ban 



ATCTATGGAAATTCGTGGATCTTCACGCAGCTACAATGCAGATCTTGGAGGTAAGTTCCA 

2821 + + + + + -f 2880 

TAGATACCTTTAAGCACCTAGAAGTGCGTCGATGTTACGTCTAGAACCTCCATTCAAGGT 



Ddel 
Hpyl78III 



Dpnl 
BstYI 
Sau3AI 
Dpnl BseMII 
Sau3AI I Apol I 

Hpyl78lll| I Tsp509I Alwl j 
Alwl 11 |DdeI I Ddelj | Bfal 

I II I I I II 
GTTCTAAAAGCGTTCCTGATCCCTTAGAAATTCTAAGAGATCCTGAGTGTATCTAGGGAC 



2881 



2940 



CAAGATTTTCGCAAGGACTAGGGAATCTTTAAGATTCTCTAGGACTCACATAGATCCCTG 



BsmFI 
I 

TTCTCAAAGA 

2941 + 2950 

AAGAGTTTCT 



Enzymes that do cut: 



ACCI 


Acelll 


Acil 


Acll 


Af IIII 


Alol 


Alul 


Alwl 


AlwNI 


ApoI 


Aval 


Avail 


BamHI 


Banl 


Banll 


BbeX 


Bbvl 


BbvCI 


Bed 


Bce83l 


Beef I 


BciVI 


Bfal 


Bglll 


Bmgl 


Bmrl 


BplI 


Bpml 


BpulOl 


Bsal 


BsaBi 


BsaHI 


BsaJI 


BsaXI 


Bsbl 


BscGI 


BseMII 


BseRI 


BseSI 


Bsgl 


BsiHKAI 


BslI 


Bsml 


BsmAI 


BsmBI 


BsmFI 


Bspl286I 


BspGI 


BspMI 


BsrI 


BsrGI 


BssSI 


BstAPI 


BstDSI 


BstYI 


BSU36I 


BtSi 


Cac8l 


Cjel 


CjePI 


Clal 


CvijI 


CviRI 


Ddel 


Dpnl 


Dral 


Drdll 


Eael 


Earl 


Ecil 


ECOO109I 


EcoRII 


Faul 


Fnu4HI 


Fokl 


Hael 


Haell 


Haelll 


HaelV 


Hgal 


Hhal 


Hin4I 


Hindlll 


Hinfl 


HphI 


Hpyl78lII 


Hpyl88lX 


Maell 


Maelll 


MboII 


Mlul 


Mmel 


Mnll 


Msd 


Msel 


MslI 


Mspl 


MspAlI 


Muni 


Mwol 


Narl 


Neil 


Nhel 


Nlalll 


NlalV 


Pad 


Pf 111081 


Plel 


Ppil 


PspSII 


PstI 


PvuII 


Rsal 


Sad 


Sadl 


Sapl 


Sau96l 


Sau3AI 


ScrFI 


SfaNI 


Sf cl 


Si ml 


Smll 


Spel 


Sse8647l 


Sspl 


Sthl32l 


Sty I 


Taal 


TaqI 


TaqI I 


TatI 


Taul 


Tf il 


Thai 


Tsel 


Tsp45I 


Tsp509I 


TspRI 


Tthllll 


Tthlllll 


Xhol 


Xmni 





Enzymes that do not cut : 



Aarl 


Aatll 


Aflll 


Ahdl 


Apal 


Apalil 


AscI 


Avrll 


Bael 


Bcgl 


Bell 


Bgll 


Bpull02l 


BeaAl 


BsaWI 


BsiEI 


Bsp24l 


BspEI 


BspLUllI 


BsrBI 


BsrDI 


BsrFI 


BssHII 


BstEII 


BstXI 


BstZl7l 


Dram 


DrdI 


EagI 


EC047III 


Eco57I 


EcoNI 


EcoRI 


EcoRV 


Fsel 


Fspl 


Gdill 


HgiEII 


Hindi 


Hpal 


Kpnl 


Ncol 


Ndel 


NgoAIV 


NotI 


Nrul 


Nsil 


Nspl 


NspV 


Pf IMI 


PinAI 


Pmel 


Pmll 


PshAI 


Pvul 


Real 


RleAI 


RsrII 


Sail 


SanDl 


Sbfl 


Seal 


SexAI 


Sf il 


Sgfl 


SgrAI 


Smal 


SnaBI 


SphI 


Srf I 


StuI 


Sun I 


Swal 


vspi 


Xbal 


Xcml 
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3. I am employed by Aventis Pasteur Limited, the assignee of the above-identified 
apphcation. Between 1 993 and 2003, 1 was responsible for the research project from 
which the above-identified patent application arose. 

4. The experiment described below was carried out at Aventis Pasteur. 

5. It has been previously demonstrated (Yang et al. Infect, Immun. May 1993. 
61(5):2037-40) that mice are susceptible to intranasal infection with different isolates of 
C pneumoniae. Strain AR-39 (Grayston et al (1990) Journal of Infectious Diseases 
161 :61 8-625) was used in Balb/c mice as a challenge infection model to examine the 
capacity of Chlamydia gene products delivered as naked DMA to elicit a protective 
response against a sublethal C. pneumoniae lung infection. Protective immunity is 
defined as an accelerated clearance of pulmonary infection. 

6. Groups of 7 to 9 week old male Balb/c mice (8 to 10 per group) were immunized 
intramuscularly (i.m.) plus intranasally (i.n.) with plasmid DNA containing a sequence 
encoding the C. pneumoniae 98 kDa protein designated CPNl 00638 (plasmid pCAI638). 
Saline or the plasmid vector lacking an inserted Chlamydial gene was given to groups of 
control animals. 

7. The sequence encoding the C. pneumoniae 98 kDa protein designated CPNl 00638 
in plasmid pCAI638 is the same as SEQ ID No:6 (encoding SEQ ID No: 14) of the above- 
identified patent application. Plasmid pCA1638 is based on plasmid pcDNA3.1(-)Myc- 
His C (Invitrogen), in which SEQ ID No:6 under control of the cytomegalovirus (CMV) 
promoter, was inserted. 

8. For i.m. immunization, alternate left and right quadriceps of mice were injected 
with 100|ig of DNA in 50|al of PBS on three occasions at 0, 3 and 6 weeks. For i.n. 
immunization, anaesthetized mice were aspirated 50fa] of PBS containing 50 \ig DNA on 
three occasions at 0, 3 and 6 weeks. At week 8, immunized mice were inoculated i.n. 



with 5x10^ IFU of C. pneumoniae, strain AR39 in 100|al of SPG buffer to test their 
ability to limit the growth of a sublethal C. pneumoniae challenge. 

9, Lungs were taken from mice at day 9 post-challenge and inmiediately homogenised 
in SPG buffer (7.5% sucrose, 5mM glutamate, 12.5mM phosphate pH7.5). The 
homogenate was stored frozen at -70°C until assay. Dilutions of the homogenate were 
assayed for the presence of infectious Chlamydia by inoculation onto monolayers of 
susceptible cells. The inoculum was centrifiiged onto the cells at 3000rpm for 1 hour, 
then the cells were incubated for three days at 35°C in the presence of 1 |J.g/ml 
cycloheximide. After incubation the monolayers were fixed with formalin and methanol 
then immunoperoxidase stained for the presence of Chlamydial inclusions using 
convalescent sera from rabbits infected with C. pneumoniae and metal-enhanced DAB as 
a peroxidase substrate. 

10. The attached Figure shows that mice immunized i.n. and i.m. with pCAI638 had 
Chlamydial lung titers less than 60,000 in 4 of 6 cases at day 9 (mean 85,483), The range 
of values for control mice sham immunized with saline was 85,000-250,000 IFU/lung 
(mean 166,288) at day 9. DNA immunisation per se was not responsible for the 
observed protective effect since another plasmid DNA construct, pCACPNM1061 , failed 
to protect, with lung liters in immunised mice similar to those obtained for saline- 
immunized control mice. The construct pCACPNM1061 is identical to pC1638 except 
that the nucleotide sequence encoding the 98 kDa protein is replaced with a 

C pneumoniae nucleotide sequence encoding a Zinc Metalloprotease protein. 

1 1 , The above-described experiment illustrates that immunization of mice with an 
expression plasmid containing a nucleotide sequence encoding SEQ ID No: 14, protects 
the mice against an intranasal challenge of C pneumoniae. 

12. I hereby declare that all statements made herein of my knowledge are true and 
that all statements made on information and belief are believed to be true; and further that 
these statements were made with the knowledge that willful false statements and the like 



are punishable by fine or imprisonment, or both, under section 1 001 of Title 1 8 of the 
United States Code, and that such willful false statements may jeopardize the validity of 
this application or any patent issuing thereon. 
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Figure 1 A: Nucleotide and deduced amino acid sequence of CPNl 00634 

gataaaaatt cttgacagct gttttgtcat ctttaacttg atttacttat tttgtttcta 60 

tattgatgcg aatagttctc taaaaaacaa aagcattacc atg aag act teg att 115 

Met Lys Thr Ser lie 
1 5 



cct tgg gtt tta gtt tec tee gtg tta get ttc tea tgt cac eta eag 
Pro Trp Val Leu Val Ser Ser Val Leu Ala Phe Ser Cys His Leu Gin 
10 15 20 



163 



tea eta get aae gag gaa ett tta tea cet gat gat age ttt aat gga 
Ser Leu Ala Asn Glu Glu Leu Leu Ser Pro Asp Asp Ser Phe Asn Gly 
25 30 35 



211 



aat ate gat tea gga acg ttt act eca aaa act tea gee aea aea tat 
Asn lie Asp Ser Gly Thr Phe Thr Pro Lys Thr Ser Ala Thr Thr Tyr 
40 45 50 



259 



tet eta aca gga gat gtc tte ttt tac gag cct gga aaa ggc act cec 
Ser Leu Thr Gly Asp Val Phe Phe Tyr Glu Pro Gly Lys Gly Thr Pro 
55 60 65 



307 



tta tet gac agt tgt ttt aag eaa aee aeg gac aat ett ace tte ttg 
Leu Ser Asp Ser Cys Phe Lys Gin Thr Thr Asp Asn Leu Thr Phe Leu 
70 75 80 85 



355 



ggg aae ggt cat age tta acg ttt ggc ttt ata gat get ggc act cat 
Gly Asn Gly His Ser Leu Thr Phe Gly Phe lie Asp Ala Gly Thr His 
90 95 100 



403 



gca ggt get get gea tet aea aca gea aat aag aat ett ace ttc tea 
Ala Gly Ala Ala Ala Ser Thr Thr Ala Asn Lys Asn Leu Thr Phe Ser 
105 110 115 



451 



999 ttt tec tta etg agt ttt gat tee tet ect age aca acg gtt act 
Gly Phe Ser Leu Leu Ser Phe Asp Ser Ser Pro Ser Thr Thr Val Thr 
120 125 130 



499 



aea ggt eag gga acg ett tec tea gea gga ggc gta aat tta gaa aat 
Thr Gly Gin Gly Thr Leu Ser Ser Ala Gly Gly Val Asn Leu Glu Asn 
135 140 145 



547 



att cgt aaa ett gta gtt get ggg aat ttt tet act gca gat ggt gga 
lie Arg Lys Leu Val Val Ala Gly Asn Phe Ser Thr Ala Asp Gly Gly 
150 155 160 165 



595 



get ate aaa gga gcg tet ttc ett tta act ggc act tet gga gat get 
Ala lie Lys Gly Ala Ser Phe Leu Leu Thr Gly Thr Ser Gly Asp Ala 
170 175 180 



643 



ett ttt agt aae aae tet tea tea aca aag gga gga gca att get act 
Leu Phe Ser Asn Asn Ser Ser Ser Thr Lys Gly Gly Ala lie Ala Thr 
185 190 195 



691 
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1 B 

aca gca ggc get cgc ata gca aat aac aca ggt tat gtt aga ttc eta 739 

Thr Ala Gly Ala Arg lie Ala Asn Asn Thr Gly Tyr Val Arg Phe Leu 

200 205 210 

tct aac ata gcg tct acg tea gga ggc get ate gat gat gaa ggc aeg 787 

Ser Asn lie Ala Ser Thr Ser Gly Gly Ala lie Asp Asp Glu Gly Thr 

215 220 225 

teg ata eta teg aac aae aaa ttt eta tat ttt gaa ggg aat gca geg 835 

Ser lie Leu Ser Asn Asn Lys Phe Leu Tyr Phe Glu Gly Asn Ala Ala 

230 235 240 245 

aaa act act ggc ggt geg ate tgc aac acc aag gcg agt gga tct ect 883 

Lys Thr Thr Gly Gly Ala lie Cys Asn Thr Lys Ala Ser Gly Ser Pro 
250 255 260 

gaa ctg ata ate tet aac aat aag act ctg ate ttt get tea aac gta 931 

Glu Leu He He Ser Asn Asn Lys Thr Leu He Phe Ala Ser Asn Val 
265 270 275 

gca gaa aca age ggt ggc gee ate cat get aaa aag eta gee ett tec 97 9 

Ala Glu Thr Ser Gly Gly Ala He His Ala Lys Lys Leu Ala Leu Ser 
280 285 290 

tct gga ggc ttt aca gag ttt eta cga aat aat gte tea tea gca act 1027 

Ser Gly Gly Phe Thr Glu Phe Leu Arg Asn Asn Val Ser Ser Ala Thr 

295 300 305 

ect aag ggg ggt get ate age ate gat gee tea gga gag etc agt ett 1075 

Pro Lys Gly Gly Ala He Ser He Asp Ala Ser Gly Glu Leu Ser Leu 

310 315 320 325 

tet gca gag aca gga aac att acc ttt gta aga aat ace ett aca aca 1123 

Ser Ala Glu Thr Gly Asn He Thr Phe Val Arg Asn Thr Leu Thr Thr 
330 335 340 

ace gga agt acc gat act ect aaa cgt aat geg ate aae ata gga agt 1171 

Thr Gly Ser Thr Asp Thr Pro Lys Arg Asn Ala He Asn He Gly Ser 
345 350 355 

aae ggg aaa ttc acg gaa tta egg get get aaa aat eat aca att ttc 1219 

Asn Gly Lys Phe Thr Glu Leu Arg Ala Ala Lys Asn His Thr He Phe 
360 365 370 

ttc tat gat cee ate act tea gaa gga acc tea tea gac gta ttg aag 1267 

Phe Tyr Asp Pro He Thr Ser Glu Gly Thr Ser Ser Asp Val Leu Lys 

375 380 385 

ata aat aae ggc tct gcg gga get etc aat cca tat caa gga aeg att 1315 

He Asn Asn Gly Ser Ala Gly Ala Leu Asn Pro Tyr Gin Gly Thr He 

390 395 400 405 



eta ttt tct gga gaa acc eta aca gca gat gaa ett aaa gtt get gae 
Leu Phe Ser Gly Glu Thr Leu Thr Ala Asp Glu Leu Lys Val Ala Asp 
410 415 420 



1363 
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aat tta aaa tct tea ttc acg cag cca gtc tec eta tec gga gga aag 
Asn Leu Lys Ser Ser Phe Thr Gin Pro Val Ser Leu Ser Gly Gly Lys 
425 430 435 



1411 



tta ttg eta caa aag gga gtc act tta gag age acg age ttc tct caa 
Leu Leu Leu Gin Lys Gly Val Thr Leu Glu Ser Thr Ser Phe Ser Gin 
440 445 450 



1459 



gag gee ggt tct etc etc ggc atg gat tea gga acg aca tta tea act 
Glu Ala Gly Ser Leu Leu Gly Met Asp Ser Gly Thr Thr Leu Ser Thr 
455 460 465 



1507 



aca get ggg agt att aca ate acg aac eta gga ate aat gtt gac tec 
Thr Ala Gly Ser lie Thr lie Thr Asn Leu Gly lie Asn Val Asp Ser 
470 475 480 485 



1555 



tta ggt ctt aag cag ccc gtc age eta aca gea aaa ggt get tea aat 
Leu Gly Leu Lys Gin Pro Val Ser Leu Thr Ala Lys Gly Ala Ser Asn 
490 495 500 



1603 



aaa gtg ate gta tct ggg aag etc aac ctg att gat att gaa ggg aac 
Lys Val lie Val Ser Gly Lys Leu Asn Leu lie Asp lie Glu Gly Asn 
505 510 515 



1651 



att tat gaa agt cat atg ttc age cat gac cag etc ttc tet eta tta 
lie Tyr Glu Ser His Met Phe Ser His Asp Gin Leu Phe Ser Leu Leu 
520 525 530 



1699 



aaa ate acg gtt gat get gat gtt gat act aac gtt gac ate age age 
Lys lie Thr Val Asp Ala Asp Val Asp Thr Asn Val Asp lie Ser Ser 
535 540 545 



1747 



ctt ate cet gtt ect get gag gat cct aat tea gaa tac gga ttc caa 
Leu lie Pro Val Pro Ala Glu Asp Pro Asn Ser Glu Tyr Gly Phe Gin 
550 555 560 565 



1795 



gga caa tgg aat gtt aat tgg act acg gat aca get aca aat aca aaa 
Gly Gin Trp Asn Val Asn Trp Thr Thr Asp Thr Ala Thr Asn Thr Lys 
570 575 580 



1843 



gag gee acg gea act tgg ace aaa aca gga ttt gtt ccc age ccc gaa 
Glu Ala Thr Ala Thr Trp Thr Lys Thr Gly Phe Val Pro Ser Pro Glu 
585 590 595 



1891 



aga aaa tct geg tta gta tgc aat ace eta tgg gga gtc ttt act gac 
Arg Lys Ser Ala Leu Val Cys Asn Thr Leu Trp Gly Val Phe Thr Asp 
600 605 610 



1939 



att cgc tet ctg caa cag ett gta gag ate ggc gea act ggt atg gaa 
lie Arg Ser Leu Gin Gin Leu Val Glu lie Gly Ala Thr Gly Met Glu 
615 620 625 



1987 



cae aaa caa ggt ttc tgg gtt tec tec atg acg aac ttc ctg eat aag 
His Lys Gin Gly Phe Trp Val Ser Ser Met Thr Asn Phe Leu His Lys 
630 635 640 645 



2035 



MAR 0 8 2004 
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act gga gat gaa aat cgc aaa ggc ttc cgt cat acc tct gga ggc tac 
Thr Gly Asp Glu Asn Arg Lys Gly Phe Arg His Thr Ser Gly Gly Tyr 
650 655 660 



2083 



gtc ate ggt gga agt get cac act cct aaa gac gac eta ttt acc ttt 
Val lie Gly Gly Ser Ala His Thr Pro Lys Asp Asp Leu Phe Thr Phe 
665 670 675 



2131 



gcg ttc tgc cat etc ttt get aga gac aaa gat tgt ttt ate get cac 
Ala Phe Cys His Leu Phe Ala Arg Asp Lys Asp Cys Phe lie Ala His 
680 685 690 



2179 



aac aac tct aga acc tac ggt gga act tta ttc ttc aag cac tct cat 
Asn Asn Ser Arg Thr Tyr Gly Gly Thr Leu Phe Phe Lys His Ser His 
695 700 705 



2227 



acc eta caa cec caa aac tat ttg aga tta gga aga gea aag ttt tct 
Thr Leu Gin Pro Gin Asn Tyr Leu Arg Leu Gly Arg Ala Lys Phe Ser 
710 715 720 725 



2275 



gaa tea get ata gaa aaa ttc cct agg gaa att cec eta gee ttg gat 
Glu Ser Ala lie Glu Lys Phe Pro Arg Glu lie Pro Leu Ala Leu Asp 
730 735 740 



2323 



gtc caa gtt teg ttc age eat tea gac aac cgt atg gaa aeg cac tat 
Val Gin Val Ser Phe Ser His Ser Asp Asn Arg Met Glu Thr His Tyr 
745 750 755 



2371 



ace tea ttg cca gaa tec gaa ggt tct tgg age aac gag tgt ata get 
Thr Ser Leu Pro Glu Ser Glu Gly Ser Trp Ser Asn Glu Cys lie Ala 
760 765 770 



2419 



ggt ggt ate ggc eta gac ctt cct ttt gtt ett tec aac cca cat cct 
Gly Gly lie Gly Leu Asp Leu Pro Phe Val Leu Ser Asn Pro His Pro 
775 780 785 



2467 



ctt ttc aag ace ttc att cca cag atg aaa gtc gaa atg gtt tat gta 
Leu Phe Lys Thr Phe He Pro Gin Met Lys Val Glu Met Val Tyr Val 
790 795 800 805 



2515 



tea caa aat age ttc ttc gaa age tct agt gat ggc cgt ggt ttt agt 
Ser Gin Asn Ser Phe Phe Glu Ser Ser Ser Asp Gly Arg Gly Phe Ser 
810 815 820 



2563 



3^tt gga agg ctg ett aac etc teg att cct gtg ggt gcg aaa ttc gtg 
He Gly Arg Leu Leu Asn Leu Ser He Pro Val Gly Ala Lys Phe Val 
825 830 835 



2611 



cag ggg gat ate gga gat tec tac acc tat gat etc tea gga ttc ttt 
Gin Gly Asp He Gly Asp Ser Tyr Thr Tyr Asp Leu Ser Gly Phe Phe 
840 845 850 



2659 



gtt tec gat gtc tat cgt aac aat cec caa tct aea gcg act ett gtg 2707 
Val Ser Asp Val Tyr Arg Asn Asn Pro Gin Ser Thr Ala Thr Leu Val 
855 860 865 



0 8 . -» 
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atg age cca gac tct tgg aaa att cgc ggt ggc aat ctt tea aga cag 
Met Ser Pro Asp Ser Trp Lys lie Arg Gly Gly Asn Leu Ser Arg Gin 
870 875 880 885 



2755 



gca ttt tta ctg agg ggt age aae aae tac gte tac aac tec aat tgt 2803 
Ala Phe Leu Leu Arg Gly Ser Asn Asn Tyr Val Tyr Asn Ser Asn Cys 
890 895 900 

gag etc tte gga eat tae get atg gaa ete cgt gga tet tea agg aae 2 851 
Glu Leu Phe Gly His Tyr Ala Met Glu Leu Arg Gly Ser Ser Arg Asn 
905 910 915 



tae aat gta gat gtt ggt ace aaa ete ega tte tagattgeta aaacteceta 2904 
Tyr Asn Val Asp Val Gly Thr Lys Leu Arg Phe 
920 925 



gttcttetag ggagttttct cataetttta gggaaatatt tgetat 



2950 
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J 



Alul 

Hpyl7 8III CviJI 
Apol iMspAlI 
Tsp509I I PvuII Msel SfaNI 

III I I 

GATAAAAATTCTTGACAGCTGTTTTGTCATCTTTAACTTGATTTACTTATTTTGTTTCTA 

1 + + + + + + 60 

CTATTTTTAAGAACTGTCGACAAAACAGTAGAAATTGAACTAAATGAATAAAACAAAGAT 

BsaJI 
Styl 
MboII I 
Hinf I I I 
Tf il I I 
Nlalll Bbsl I I I 
MslI I TaqI | | | 

I I I I II 

TATTGATGCGAATAGTTCTCTAAAAAACAAAAGCATTACCATGAAGACTTCGATTCCTTG 

61 + + + + + + 120 

ATAACTACGCTTATCAAGAGATTTTTTGTTTTCGTAATGGTACTTCTGAAGCTAAGGAAC 

HphI Maelll Alul 

Alul I Nlalll Taal CviJI 

CviJI |MaeIIl| Tsp45I Mnll | 

Mnll] |Tsp45l| Sfcl I Bfal II HphI 

II I II I I I II I 

GGTTTTAGTTTCCTCCGTGTTAGCTTTCTCATGTCACCTACAGTCACTAGCTAACGAGGA 

121 + + + + + + 180 

CCAAAATCAAAGGAGGCACAATCGAAAGAGTACAGTGGATGTCAGTGATCGATTGCTCCT 

BsaXI 
Alol I 
Ppil I 
EC057I I I 
Acll I I I 
Maell I I I 
HpylVBIII I j 11 
Hinf I I I I I I 
Msel Tf il I I I I I 

Alul I Clal I j I I I I 

CviJI I TaqI | | | | | | 

II I I I II II 

ACTTTTATCACCTGATGATAGCTTTAATGGAAATATCGATTCAGGAACGTTTACTCCAAA 

181 + + + + + + 240 

TGAAAATAGTGGACTACTATCGAAATTACCTTTATAGCTAAGTCCTTGCAAATGAGGTTT 
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Figure 2 A: Restriction enzyme analysis of CPNl 00634 
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Fig. 2 B 



241 



Bbsl 

CviJI MboII 

I I 
AACTTCAGCCACAACATATTCTCTAACAGGAGATGTCTTCTTTTACGAGCCTGGAAAAGG 

+ + + + + + 

TTGAAGTCGGTGTTGTATAAGAGATTGTCCTCTACAGAAGAAAATGCTCGGACCTTTTCC 
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BslI . 
EcoNI 
ScrFI I 
CviJI I I 
EcoRII I I 
I I I 



300 



CjePI 
Msel I 



BsaJI 
BstDSI 



CjePI 
BslI I 



Hpyl8 8IX Taal 

I I II I II 

CACTCCCTTATCTGACAGTTGTTTTAAGCAAACCACGGACAATCTTACCTTCTTGGGGAA 

301 + + + + + + 

GTGAGGGAATAGACTGTCAACAAAATTCGTTTGGTGCCTGTTAGAATGGAAGAACCCCTT 



360 



CviRI 
Fnu4HI 
Tsel 
Fnu4HI 
AlwNI 



BstAPI 
Mwol 
Tsel 
Aarl 
MslI 
CviRI 



Acll 
Mae 1 1 
Msel I 
Alul I I CviJI 
CviJI I jsfaNI I 
III II 



Nlalll 
Bbvl 
Bbvl I 
BspMI I I 

Taal CviJI I ISfaNI | Cac8l| j | 

II I I 

CGGTCATAGCTTAACGTTTGGCTTTATAGATGCTGGCACTCATGCAGGTGCTGCTGCATC 
GCCAGTATCGAATTGCAAACCGAAATATCTACGACCGTGAGTACGTCCACGACGACGTAG 



420 



BseMII 

Hinfl BslI I Ddel Hinfl 

SfaNI Tfil Ddel | j BseMII | Tfil 

I I II I I I I 

TACAACAGCAAATAAGAATCTTACCTTCTCAGGGTTTTCCTTACTGAGTTTTGATTCCTC 

421 + + + + + + 480 

ATGTTGTCGTTTATTCTTAGAATGGAAGAGTCCCAAAAGGAATGACTCAAAACTAAGGAG 



Maelll 
Mnll I 
Bfal I Taal Sfcl 
II II 



Mnll 
BbvCI I 
BpulOI I 
XmnI Ddel I 
I I I 



Apol 
Tsp509I 
BseMII I 
Mnll I I 

I II 



TCCTAGCACAACGGTTACTACAGGTCAGGGAACGCTTTCCTCAGCAGGAGGCGTAAATTT 

481 + + + + + + 540 

AGGATCGTGTTGCCAATGATGTCCAGTCCCTTGCGAAAGGAGTCGTCCTCCGCATTTAAA 
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Bed 

Apol PstI I Hgal 

Sspl Cjel CviRI I I Alul I 

Xnml| Tsp509I Sfcl | | | Cvi JI | 

II I I II I II 

AGAAAATATTCGTAAACTTGTAGTTGCTGGGAATTTTTCTACTGCAGATGGTGGAGCTAT 

541 + --" + + + + + 600 

TCTTTTATAAGCATTTGAACATCAACGACCCTTAAAAAGATGACGTCTACCACCTCGATA 

Hpyl78III Bpml 
BsrI I MboII I 

Cjel Msel SfaNI | Maelll | | 

I III II I 

CAAAGGAGCGTCTTTCCTTTTAACTGGCACTTCTGGAGATGCTCTTTTTAGTAACAACTC 

601 + + + + + + 660 

GTTTCCTCGCAGAAAGGAAAATTGACCGTGAAGACCTCTACGAGAAAAATCATTGTTGAG 
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Fig. 2 C 



CacSI 

Mwol Haell| 
Mnll Muni BseRI | Hhal | | 

Earl I Tsp509I Sfcl | CacSI | j |MwoI 

II I II I III I 

TTCATCAACAAAGGGAGGAGCAATTGCTACTACAGCAGGCGCTCGCATAGCAAATAACAC 

661 + + + + + + 720 

AAGTAGTTGTTTCCCTCCTCGTTAACGATGATGTCGTCCGCGAGCGTATCGTTTATTGTG 

Clal 

Hgal Hpyl78III TaqI 

BsaBI I . Maell | Haell | 

Hinfl I I Mnll | Hhal | | 

Tf il I I Acci I I Hin4I | j | 

III III III I 

AGGTTATGTTAGATTCCTATCTAACATAGCGTCTACGTCAGGAGGCGCTATCGATGATGA 

721 + + + + + + 780 

TCCAATAC7VATCTAAGGATAGATTGTATCGCAGATGCAGTCCTCCGCGATAGCTACTACT 

Bsml 
Fnu4HI I 
CviRI I I 

TaqI Apol Tsel j j 

Maell I TaqI Tsp509I CjePI | | | 

I I I I I III 

AGGCACGTCGATACTATCGAACAACAAATTTCTATATTTTGAAGGGAATGCAGCGAAAAC 

781 + + + + + + 840 

TCCGTGCAGCTATGATAGCTTGTTGTTTAAAGATATAAAACTTCCCTTACGTCGCTTTTG 



9/77 



Fig. 2 D 



REPLACEMENT SHEET 
Title: CHLAMYDIA ANTIGENS AND 
CORRESPONDING DNA FRAGMENTS AND 
USES THEREOF 
Inventor(s): Andrew D. MURDIN et al. 
Appl. No.: 09/857,128 



BsaJI 
Styl 
Bsbl 
CjePI 
CviRI 



Cjel 
Dpnl 
Sau3AI 
Bsri I 
Acil| I 
Bbvl I I I 

III I 



Alwl 
Hpyl78III 
Dpnl 
BstYI I 
Hin4I I 
Sau3AI I 

I 



Cjel BplI 
I I 



841 



III t I I I I I I II II I I 

TACTGGCGGTGCGATCTGCAACACCAAGGCGAGTGGATCTCCTGAACTGATAATCTCTAA 

ATGACCGCCACGCTAGACGTTGTGGTTCCGCTCACCTAGAGGACTTGACTATTAGAGATT 



-+ 900 



Dpnl 
Sau3AI I 
Hpyl88IX| I 
Plel Hinf I I I I 
I II I 



Cjel 
Maell I 
I 



Acil 
Fokl I 

I I 



Bed 
Haell 
Hhal 
NlalV 
BsaHI I 
Narl I 
Bani I I 
Ml 



Nlalll 
Mwol I 
I 



960 



I I I I I II II I I I II I I I 

CAATAAGACTCTGATCTTTGCTTCAAACGTAGCAGAAACAAGCGGTGGCGCCATCCATGC 

GTTATTCTGAGACTAGAAACGAAGTTTGCATCGTCTTTGTTCGCCACCGCGGTAGGTACG 

CviJI 
Cac8I 
Bf al 
Alul I , 

CviJI I j j Hpyl78III Mnll Pf 111081 

Cjel Nheljj I Mnll | CviJI | Bpml | BsmAI 

I III I I I II II I 
TAAAAAGCTAGCCCTTTCCTCTGGAGGCTTTACAGAGTTTCTACGAAATAATGTCTCATC 
961 + + + + + + 1020 

ATTTTTCGATCGGGAAAGGAGACCTCCGAAATGTCTCAAAGATGCTTTATTACAGAGTAG 



BSU36I 
Ddel 



BslI 



Sf aNI 



Clal 
TaqI 
I 



BseMII 
Banll 
BsiHKAI 
Bspl286I 
Sad 

Hpyl78III Ddel I 
BSU36I I Alul I I I CviRI 
Ddel i CviJI I i I Sf cl I 
SfaNl| I Mnll 11 | BsmAI | j 
II I III I II I 



AGCAACTCCTAAGGGGGGTGCTATCAGCATCGATGCCTCAGGAGAGCTCAGTCTTTCTGC 



1021 -- 



+ + 1080 



TCGTTGAGGATTCCCCCCACGATAGTCGTAGCTACGGAGTCCTCTCGAGTCAGAAAGACG 
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Fig. 2 E 

BseMII Mspl BsaXI 

PstI BsaWI I Rsal | 

I II I I 

AGAGACAGGAAACATTACCTTTGTAAGAAATACCCTTACAACAACCGGAAGTACCGATAC 

1081 + + + + + + 1140 

TCTCTGTCCTTTGTAATGGAAACATTCTTTATGGGAATGTTGTTGGCCTTCATGGCTATG 

Tsp509I 

Sthl32I I Fnu4HI 

Apol I I Cvi JI I 

Dpnl Maelll Tsp509I | | Tsel | 

Maell Sau3AI | Sthl32I | BscGI | Bbvl | | BscGI || 

I II II I I II I I II 

TCCTAAACGTAATGCGATCAACATAGGAAGTAACGGGAAATTCACGGAATTACGGGCTGC 

1141 + + + + + + 1200 

AGGATTTGCATTACGCTAGTTGTATCCTTCATTGCCCTTTAAGTGCCTTAATGCCCGACG 

Dpnl Mnll 

Sau3AI I Mae II I 

Tsp509I EC057I | j Hpyl88IX Hpyl88IX | | 

MboII I Alwl| I I Bed I NlalV I 11 

II II I I I I I IN 

TAAAAATCATACAATTTTCTTCTATGATCCCATCACTTCAGAAGGAACCTCATCAGACGT 

1201 + + + + + + 1260 

ATTTTTAGTATGTTAAAAGAAGATACTAGGGTAGTGAAGTCTTCCTTGGAGTAGTCTGCA 

Banll 
Beef I 
BsiHKAI 
CviJI Bspl286I 
Faul I SacI 
MboII I Alul I Hinfl 

Sthl32l| I Acil CviJI I BslI Tfil 

III I I I I I 

ATTGAAGATAAATAACGGCTCTGCGGGAGCTCTCAATCCATATCAAGGAACGATTCTATT 

1261 + + + + + + 1320 

TAACTTCTATTTATTGCCGAGACGCCCTCGAGAGTTAGGTATAGTTCCTTGCTAAGATAA 

Dral 
Msel I 
MboII I I 

Hpyl78III Bpml Msel Tsp509I | j | 

I II MM 

TTCTGGAGAAACCCTAACAGCAGATGAACTTAAAGTTGCTGACAATTTAAAATCTTCATT 

1321 + + + + + + 1380 

AAGACCTCTTTGGGATTGTCGTCTACTTGAATTTCAACGACTGTTAAATTTTAGAAGTAA 
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BsrI 
CviJI I 
Fnu4HI I I 
Tsell II 



II II 



Hpyl78III 

Mspl I 
BsaWI I 
BspEI 
Mnll I 
Bbvl I I 
BsmAI I I 
I I I 



Maelll 
Tsp4 5I 
Hinf I I 

I I 



Bce83I 
Plel I 



CACGCAGCCAGTCTCCCTATCCGGAGGAAAGTTATTGCTACAAAAGGGAGTCACTTTAGA 

1381 + + + + + + 1440 

GTGCGTCGGTCAGAGGGATAGGCCTCCTTTCAATAACGATGTTTTCCCTCAGTGAAATCT 



Mspl 
BseRI 
BsrFI 
CviJI 
Haelll 
Hpyl7 8III 

Smll 
Mnll 



Hin4I 
Alul 
CviJI 
BsiHKAI I 
Bspl286I I 
BssSl| I 
I I I 



Hpyl78III 
Hinfl 
Mnll 
Tf il 

BsaJI Nlallll 
BplI I BslI I I I Cjel 

II I II I I 

GAGCACGAGCTTCTCTCAAGAGGCCGGTTCTCTCCTCGGCATGGATTCAGGAACGACATT 

1441 + + + + + + 1500 

CTCGTGCTCGAAGAGAGTTCTCCGGCCAAGAGAGGAGCCGTACCTAAGTCCTTGCTGTAA 



Hinfl 

Alul Bfal I 

CviJI Avrll| I 

MspAlI BsaJI I I 

PvuII Cjel II I 

Sfcl I Hpyl78III Styl|TfiI Plel 

II I II I I 

ATCAACTACAGCTGGGAGTATTACAATCACGAACCTAGGAATCAATGTTGACTCCTTAGG 

1501 + + + + + + 1560 

TAGTTGATGTCGACCCTCATAATGTTAGTGCTTGGATCCTTAGTTACAACTGAGGAATCC 



BSU36I 
Hinfl I 
Hindi I I 
I I Ddel 
I I I 



Bbvl 
CviJI 
Sthl32I 
BscGI 
CviJI 
Fnu4HI 
Msel I 

Af III I ill II BstAPI 

Smll|Tsel| II II Mwol CjePI 

II II I I II I I 

TCTTAAGCAGCCCGTCAGCCTAACAGCAAAAGGTGCTTCAAATAAAGTGATCGTATCTGG 

1561 + + + + + + 1620 

AGAATTCGTCGGGCAGTCGGATTGTCGTTTTCCACGAAGTTTATTTCACTAGCATAGACC 



Dpnl 
Sau3AI I 
I I 




12/77 



Fig. 2 G 



Alul 
CviJI 



CjePI 
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Nlalll 
MboII I 
CviJI I I 
Ndel CjePI | | | 
I II II 



GAAGCTCAACCTGATTGATATTGAAGGGAACATTTATGAAAGTCATATGTTCAGCCATGA 

1621 + + + + + + 1680 

CTTCGAGTTGGACTAACTATAACTTCCCTTGTAAATACTTTCAGTATACAAGTCGGTACT 



Msel 

Acelll I Taal 
Alul Earl | | CjePI | 

CviJI Sapl I jsfaNI | j 
I III III 



Bael 



Hindi 
Acll I 
Maell j 



CCAGCTCTTCTCTCTATTAAAAATCACGGTTGATGCTGATGTTGATACTAACGTTGACAT 

1681 + + + + + + 1740 

GGTCGAGAAGAGAGATAATTTTTAGTGCCAACTACGACTACAACTATGATTGCAACTGTA 



HpylSSIX 
Alwl 
Tsp509I 
Dpnl 
NlalV 
BamHI 



BstYI 
Sau3AI 
BbvCI 

Bael BpulOI 
CviJI I Mnll I 

Fnu4HI I I Bbvl | Ddel 
Tsel| I iBseMII I Alwl I 

III I I I II 



BsaJI 
Styl 
Hinfl I 
Tf il I 
I I 



CAGCAGCCTTATCCCTGTTCCTGCTGAGGATCCTAATTCAGAATACGGATTCCAAGGACA 

1741 + + + + + + 1800 

GTCGTCGGAATAGGGACAAGGACGACTCCTAGGATTAAGTCTTATGCCTAAGGTTCCTGT 



BsaJI 
BstDSI 
CviJI I 

Alul Haelj Avail 

CviJI Mnll Haelllj Sau96I 

I I II I 

ATGGAATGTTAATTGGACTACGGATACAGCTACAAATACAAAAGAGGCCACGGCAACTTG 

1801 + + + + + + 1860 

TACCTTACAATTAACCTGATGCCTATGTCGATGTTTATGTTTTCTCCGGTGCCGTTGAAC 



Tsp509I 

Msel I BciVI 



Bcefl CviJI Sthl32I CviRI 

I II I 

GACCAAAACAGGATTTGTTCCCAGCCCCGAAAGAAAATCTGCGTTAGTATGCAATACCCT 

1861 + + + + + + 1920 

CTGGTTTTGTCCTAAACAAGGGTCGGGGCTTTCTTTTAGACGCAATCATACGTTATGGGA 



Fig. 2 H 
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Dpnl 

Hinfl Plel Hin4I CviRI CviJI Sau3AI | Hhal 

I II II III 

ATGGGGAGTCTTTACTGACATTCGCTCTCTGCAACAGCTTGTAGAGATCGGCGCAACTGG 

1921 + + + + + + 1980 

TACCCCTCAGAAATGACTGTAAGCGAGAGACGTTGTCGAACATCTCTAGCCGCGTTGACC 



Bcgl Mnll 
Bsri Tthlllll I Nlalll | CviRI 

I I I I I I 

TATGGAACACAAACAAGGTTTCTGGGTTTCCTCCATGACG7VACTTCCTGCATAAGACTGG 

1981 + + + + + + 2040 

ATACCTTGTGTTTGTTCCAAAGACCCAAAGGAGGTACTGCTTGAAGGACGTATTCTGACC 



Mae 1 1 
BsaXI 

BsaBI Hin4I 
Bcgl I Bpml Hpyl78III Mnll 

Bsrl| I CviJI Mnll | CviJI | | Bpml 

Ml I I I II I I 

AGATGAAAATCGCAAAGGCTTCCGTCATACCTCTGGAGGCTACGTCATCGGTGGAAGTGC 

2041 + + + + + + 2100 

TCTACTTTTAGCGTTTCCGAAGGCAGTATGGAGACCTCCGATGCAGTAGCCACCTTCACG 



BsiHKAI Bfal 
Bspl286I Bed BsmAI | 

I III 
TCACACTCCTAAAGACGACCTATTTACCTTTGCGTTCTGCCATCTCTTTGCTAGAGACAA 

2101 + + + + + + 2160 

AGTGTGAGGATTTCTGCTGGATAAATGGAAACGCAAGACGGTAGAGAAACGATCTCTGTT 

Hpyl7 8III 
Bfal I 

Xbal I I MboII 
Cjel I I I Taal | Cjel 

I III II I 

AGATTGTTTTATCGCTCACAACAACTCTAGAACCTACGGTGGAACTTTATTCTTCAAGCA 



2161 -- 



---+ 2220 



TCTAACAAAATAGCGAGTGTTGTTGAGATCTTGGATGCCACCTTGAAATAAGAAGTTCGT 



Hinfl 
Tf il 

Earl Hpyl8 8IX | 

Tthlllll Sapl MboII | | 

I I III 

CTCTCATACCCTACAACCCCAAAACTATTTGAGATTAGGAAGAGCAAAGTTTTCTGAATC 

2221 + + + + + + 2280 

GAGAGTATGGGATGTTGGGGTTTTGATAAACTCTAATCCTTCTCGTTTCAAAAGACTTAG 



MAR » 8 2004 
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Sfcl 
Alul I 
Cvi JI I 

II 



2281 



BslI 
Cjel 
BsaJI 
Styl 
Cvi JI I 
Bfal I I 

...... I II. , . 

AGCTATAGAAAAATTCCCTAGGGAAATTCCCCTAGCCTTGGATGTCCAAGTTTCGTTCAG 

TCGATATCTTTTTAAGGGATCCCTTTAAGGGGATCGGAACCTACAGGTTCAAAGCAAGTC 



Apol 
Tsp509I 



Apol 
Tsp509I 
Bfal 
Avrll I 
BsaJI I 
Styl I 
I 



Fokl CviJI 



Hpyl8 8IX 
Hinf I 
Tf il 

Taal Mnll I 

Hpyl88IX Cjel | BsrDI CjePI | | 

III I I II 

CCATTCAGACAACCGTATGGAAACGCACTATACCTCATTGCCAGAATCCGAAGGTTCTTG 

2341 + + + + + + 2400 

GGTAAGTCTGTTGGCATACCTTTGCGTGATATGGAGTAACGGTCTTAGGCTTCCAAGAAC 



Alul Bfal 
CviJI CviJI I Fokl 

CjePI I Haelll | CjePI | 

II II II 

GAGCAACGAGTGTATAGCTGGTGGTATCGGCCTAGACCTTCCTTTTGTTCTTTCCAACCC 

2401 + + + + + + 2460 

CTCGTTGCTCACATATCGACCACCATAGCCGGATCTGGAAGGAAAACAAGAAAGGTTGGG 

CjePI 
Mmel I 
Mnll I 
RleAI I 

Hpyl78III I I I Taqi 
II 

ACATCCTCTTTTCAAGACCTTCATTCCACAGATGAAAGTCGAAATGGTTTATGTATCACA 



2461 -• 



+ + 2520 



TGTAGGAGAAAAGTTCTGGAAGTAAGGTGTCTACTTTCAGCTTTACCAAATACATAGTGT 



Alul 
CviJI 
MboII I 
I I 



Bfal 
Alul 
CviJI 
Bcefl I 
NspV I I 
TaqI I I 
I I I 



BsaJI 
BstDSI 
CviJI 
Haelll 
Bed I 
Eael I 
Gdill I 
I I 



Bbvl 



Fnu4HI 
CviJI I 
Tsel I Msel 
I I I 



AAATAGCTTCTTCGAAAGCTCTAGTGATGGCCGTGGTTTTAGTATTGGAAGGCTGCTTAA 

2521 + + + + + + 2580 

TTTATCGAAGAAGCTTTCGAGATCACTACCGGCACCAAAATCATAACCTTCCGACGAATT 
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J 

Mnll 
Bcgl I 



Hinf I I I Bsgl 

Tfil I i Hinf I I 

Hpyl78III I I I Tfil I 

TaqI I I I Bcgl | | 

Taqll I I I Apol Hpyl88IX| | | 

RleAI II II Tsp50 9I CviRI EcoRV | | | | 

II I II I I I II I I 

CCTCTCGATTCCTGTGGGTGCGAAATTCGTGCAGGGGGATATCGGAGATTCCTACACCTA 

2581 + + + + + + 2640 



GGAGAGCTAAGGACACCCACGCTTTAAGCACGTCCCCCTATAGCCTCTAAGGATGTGGAT 




Hinf I 
Tfil 
Hpyl78III I 

Dpnl I I Hpyl88IX Maelll Plel 

Sau3AI |DdeI | | BseMII | BsaBI | Sf cl | Hinf I 

I I I II I I I I III 

TGATCTCTCAGGATTCTTTGTTTCCGATGTCTATCGTAACAATCCCCAATCTACAGCGAC 

2641 + + + + + + 2700 

ACTAGAGAGTCCTAAGAAACAAAGGCTACAGATAGCATTGTTAGGGGTTAGATGTCGCTG 



Apol 
Tsp509I 
BstXI 
Hinf I 
Banll 
Bspl286I 
CviJI I 
Plel I I 
II I 



Acil 
Thai 



Mnll 



Hpyl78III BseMII 



I 



I I 



TCTTGTGATGAGCCCAGACTCTTGGAAAATTCGCGGTGGCAATCTTTCAAGACAGGCATT 

2701 + + + + + + 2760 

AGAACACTACTCGGGTCTGAGAACCTTTTAAGCGCCACCGTTAGAAAGTTCTGTCCGTAA 



Earl 
Sapl 
Hpyl8 8IX I 
Banll I I 
BsiHKAI I I 
Bspl286I I I 
MboII Sad I I 
AccI Muni I Alul I I I 

Ddel Maell | Tsp509I j CviJI j | | 

I II I I II II 

TTTACTGAGGGGTAGCAACAACTACGTCTACAACTCCAATTGTGAGCTCTTCGGACATTA 

2761 + + + + + + 2820 

AAATGACTCCCCATCGTTGTTGATGCAGATGTTGAGGTTAACACTCGAGAAGCCTGTAAT 
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Hinf I 
Tf il 
Hpyl8 8IX 
Kpnl 
NlalV I 
Rsal I 

Alwl BanI I I 

I III 

CGCTATGGAACTCCGTGGATCTTCAAGGAACTACAATGTAGATGTTGGTACCAAACTCCG 

2821 + + + + + + 2880 

GCGATACCTTGAGGCACCTAGAAGTTCCTTGATGTTACATCTACAACCATGGTTTGAGGC 



Dpnl 
BstYI 
Sau3AI 
BsaJI I 
BstDSI I 
Mbollj I 
II I 



Hpyl78III 

Bfal| Bfal 
Xbal I I MboII I Bfal 

III III 
ATTCTAGATTGCTAAAACTCCCTAGTTCTTCTAGGGAGTTTTCTCATACTTTTAGGGAAA 

2881 + + + + + + 2940 

TAAGATCTAACGATTTTGAGGGATCAAGAAGATCCCTCAAAAGAGTATGAAAATCCCTTT 

Sspl 



TATTTGCTAT 

2941 + 2950 

ATAAACGATA 
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Figure 3 A: Nucleotide and deduced amino acid sequence of CPN100635 

agttaaagat gacaaaacag ctgtcaagaa tttttatctt gactctctga gttttctatt 60 

ttatatgacg caagtaagaa tttaataata aagtgggttt atg aaa teg caa ttt 115 

Met Lys Ser Gin Phe 
1 5 



tec 


tgg 


4- ^ ^ 

uca 


gtg 


etc 


uCu 


teg 


aca 


ttg 


gea 


tgt 


4-4-4- 


ae u 


agt 


tgt 


tee 


Ser 


Trp 


Leu 


Val 


Leu 


Ser 


Ser 


Thr 


Leu 


Ala 


Cys 


Phe 


Thr 


Ser 


Cys 


Ser 










10 










15 










20 




act 


gtt 


ttt 


get 


gca 


act 


get 


gaa 


aat 


ata 


ggc 


ccc 


tct 


gat 


age 


ttt 


Thr 


Val 


Phe 


Ala 


Ala 


Thr 


Ala 


Glu 


Asn 


He 


Gly 


Pro 


Ser 


Asp 


Ser 


Phe 








25 










30 










35 






gac 


gga 


agt 


act 


aac 


aca 


gge 


acc 


tat 


act 


cct 


aaa 


aat 


aeg 


act 


act 


Asp 


Gly 


Ser 


Thr 


Asn 


Thr 


Gly 


Thr 


Tyr 


Thr 


Pro 


Lys 


Asn 


Thr 


Thr 


Thr 














Gly 


Thr 


Tyr 


Thr 


Pro 


Lys 


Asn 


Thr 


Thr 


Thr 






40 










45 










50 








gga 


ata 


gac 


tat 


act 


ctg 


aca 


gga 


gat 


ata 


act 


ctg 


caa 


aac 


ctt 


ggg 


Gly 


He 


Asp 


Tyr 


Thr 


Leu 


Thr 


Gly 


Asp 


He 


Thr 


Leu 


Gin 


Asn 


Leu 


Gly 


Gly 


He 


Asp 


Tyr 


Thr 


Leu 


Thr 


Gly 


Asp 


He 


Thr 


Leu 


Gin 


Asn 


Leu 


Gly 




55 










60 










65 










gat 


teg 


gca 


get 


tta 


aeg 


aag 


ggt 


tgt 


ttt 


tct 


gac 


act 


aeg 


gaa 


tct 


Asp 


Ser 


Ala 


Ala 


Leu 


Thr 


Lys 


Gly 


Cys 


Phe 


Ser 


Asp 


Thr 


Thr 


Glu 


Ser 


Asp 


Ser 


Ala 


Ala 


Leu 


Thr 


Lys 


Gly 


Cys 


Phe 


Ser 


Asp 


Thr 


Thr 


Glu 


Ser 



163 



211 



259 



307 



355 



70 



75 



80 



85 



tta 


age 


ttt 


gee 


ggt 


aag 


ggg 


tac 


tea 


ctt 


tct 


ttt 


tta 


aat 


att 


aag 


403 


Leu 


Ser 


Phe 


Ala 


Gly 


Lys 


Gly 


Tyr 


Ser 


Leu 


Ser 


Phe 


Leu 


Asn 


He 


Lys 




Leu 


Ser 


Phe 


Ala 


Gly 


Lys 


Gly 


Tyr 


Ser 


Leu 


Ser 


Phe 


Leu 


Asn 


He 


Lys 












90 










95 










100 







tct 


agt 


get 


gaa 


ggc 


gca 


gee 


ctt 


tct 


gtt 


aca 


act 


gat 


aaa 


aat 


ctg 


451 


Ser 


Ser 


Ala 


Glu 


Gly 


Ala 


Ala 


Leu 


Ser 


Val 


Thr 


Thr 


Asp 


Lys 


Asn 


Leu 




Ser 


Ser 


Ala 


Glu 


Gly 


Ala 


Ala 


Leu 


Ser 


Val 


Thr 


Thr 


Asp 


Lys 


Asn 


Leu 










105 










110 










115 









teg 


eta 


aca 


gga 


ttt 


teg 


agt 


ctt 


act 


tte 


tta 


gcg 


gee 


eca 


tea 


teg 


Ser 


Leu 


Thr 


Gly 


Phe 


Ser 


Ser 


Leu 


Thr 


Phe 


Leu 


Ala 


Ala 


Pro 


Ser 


Ser 


Ser 


Leu 


Thr 


Gly 


Phe 


Ser 


Ser 


Leu 


Thr 


Phe 


Leu 


Ala 


Ala 


Pro 


Ser 


Ser 






120 










125 










130 









499 



gta 


ate 


aca 


acc 


ccc 


tea 


gga 


aaa 


ggt 


gca 


gtt 


aaa 


tgt 


gga 


ggg 


gat 


Val 


He 


Thr 


Thr 


Pro 


Ser 


Gly 


Lys 


Gly 


Ala 


Val 


Lys 


Cys 


Gly 


Gly 


Asp 


Val 


He 


Thr 


Thr 


Pro 


Ser 


Gly 


Lys 


Gly 


Ala 


Val 


Lys 


Cys 


Gly 


Gly 


Asp 




135 










140 










145 











547 



Ctt 


aca 


ttt 


gat 


aac 


aat 


gga 


act 


att 


tta 


ttt 


aaa 


caa 


gat 


tac 


tgt 


595 


Leu 


Thr 


Phe 


Asp 


Asn 


Asn 


Gly 


Thr 


He 


Leu 


Phe 


Lys 


Gin 


Asp 


Tyr 


Cys 




Leu 


Thr 


Phe 


Asp 


Asn 


Asn 


Gly 


Thr 


He 


Leu 


Phe 


Lys 


Gin 


Asp 


Tyr 


Cys 




150 










155 










160 










165 
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gag 


gaa 


aat 


ggc 


gga 


gcc 


att 


tct 


acc 


aag 


aat 


ctt 


tct 


ttg 


aaa 


aac 


643 


Glu 


Glu 


Asn 


Gly 


Gly 


Ala 


He 


Ser 


Thr 


Lys 


Asn 


Leu 


Ser 


Leu 


Lys 


Asn 




Glu 


Glu 


Asn 


Gly 


Gly 


Ala 


He 


Ser 


Thr 


Lys 


Asn 


Leu 


Ser 


Leu 


Lys 


Asn 












170 










175 










180 







age 


acg 


gga 


teg 


att 


tct 


ttt 


gaa 


ggg 


aat 


aaa 


teg 


age 


gca 


aea 


ggg 


691 


Ser 


Thr 


Gly 


Ser 


He 


Ser 


Phe 


Glu 


Gly 


Asn 


Lys 


Ser 


Ser 


Ala 


Thr 


Gly 




Ser 


Thr 


Gly 


Ser 


He 


Ser 


Phe 


Glu 


Gly 


Asn 


Lys 


Ser 


Ser 


Ala 


Thr 


Gly 










185 










190 










195 









aaa 


aaa 


ggt 


ggg 


get 


att 


tgt 


get 


act 


ggt 


act 


gta 


gat 


att 


aea 


aat 


739 


Lys 


Lys 


Gly 


Gly 


Ala 


He 


Cys 


Ala 


Thr 


Gly 


Thr 


Val 


Asp 


He 


Thr 


Asn 




Lys 


Lys 


Gly 


Gly 


Ala 


He 


Cys 


Ala 


Thr 


Gly 


Thr 


Val 


Asp 


He 


Thr 


Asn 








200 










205 










210 











aat 


acg 


get 


cet 


acc 


etc 


ttc 


teg 


aac 


aat 


att 


get 


gaa 


get 


gca 


ggt 


787 


Asn 


Thr 


Ala 


Pro 


Thr 


Leu 


Phe 


Ser 


Asn 


Asn 


He 


Ala 


Glu 


Ala 


Ala 


Gly 




Asn 


Thr 


Ala 


Pro 


Thr 


Leu 


Phe 


Ser 


Asn 


Asn 


He 


Ala 


Glu 


Ala 


Ala 


Gly 






215 










220 










225 













gga 


get 


ata 


aat 


age 


aca 


gga 


aac 


tgt 


aea 


att 


aea 


ggg aat 


acg 


tct 


835 


Gly 


Ala 


He 


Asn 


Ser 


Thr 


Gly 


Asn 


Cys 


Thr 


He 


Thr 


Gly Asn 


Thr 


Ser 




Gly 


Ala 


He 


Asn 


Ser 


Thr 


Gly 


Asn 


Cys 


Thr 


He 


Thr 


Gly Asn 


Thr 


Ser 




230 










235 










240 








245 





ctt 


gta 


ttt 


tet 


gaa 


aat 


agt 


gtg 


aca 


gcg 


acc 


gca 


gga 


aat 


gga 


gga 


883 


Leu 


Val 


Phe 


Ser 


Glu 


Asn 


Ser 


Val 


Thr 


Ala 


Thr 


Ala 


Gly 


Asn 


Gly 


Gly 




Leu 


Val 


Phe 


Ser 


Glu 


Asn 


Ser 


Val 


Thr 


Ala 


Thr 


Ala 


Gly 


Asn 


Gly 


Gly 












250 










255 










260 







get 


ctt 


tct 


gga 


gat 


gcc 


gat 


gtt 


ace 


ata 


tet 


ggg 


aat 


cag 


agt 


gta 


931 


Ala 


Leu 


Ser 


Gly 


Asp 


Ala 


Asp 


Val 


Thr 


He 


Ser 


Gly 


Asn 


Gin 


Ser 


Val 




Ala 


Leu 


Ser 


Gly 


Asp 


Ala 


Asp 


Val 


Thr 


He 


Ser 


Gly 


Asn 


Gin 


Ser 


Val 










265 










270 










275 









act 


ttc 


tea 


gga 


aac 


eaa 


get 


gta 


get 


aat 


ggc 


gga gee 


att 


tat 


get 


979 


Thr 


Phe 


Ser 


Gly 


Asn 


Gin 


Ala 


Val 


Ala 


Asn 


Gly 


Gly Ala 


He 


Tyr 


Ala 




Thr 


Phe 


Ser 


Gly 


Asn 


Gin 


Ala 


Val 


Ala 


Asn 


Gly 


Gly Ala 


He 


Tyr 


Ala 








280 










285 








290 











aag 


aag 


ctt 


aca 


ctg 


get 


tec 


ggg 


ggg 


ggg 


ggg 


ggg aat 


eec 


ttt 


tet 


1027 


Lys 


Lys 


Leu 


Thr 


Leu 


Ala 


Ser 


Gly 


Gly 


Gly 


Gly 


Gly Asn 


Pro 


Phe 


Ser 




Lys 


Lys 


Leu 


Thr 


Leu 


Ala 


Ser 


Gly 


Gly 


Gly 


Gly 


Gly Asn 


Pro 


Phe 


Ser 






295 










300 










305 











aac 


aat 


ata 


gtc 


caa 


ggt 


acc 


act 


gca 


ggt 


aat 


ggt gga gcc 


att 


tct 


1075 


Asn 


Asn 


He 


Val 


.Gin 


Gly 


Thr 


Thr 


Ala 


Gly 


Asn 


Gly Gly Ala 


He 


Ser 




Asn 


Asn 


He 


Val 


Gin 


Gly 


Thr 


Thr 


Ala 


Gly 


Asn 


Gly Gly Ala 


He 


Ser 




310 










315 










320 






325 





ata 


ctg 


gca 


get 


gga 


gag 


tgt 


agt 


ctt 


ttc 


age 


gaa 


gca 


ggg 


gae 


cat 


1123 


He 


Leu 


Ala 


Ala 


Gly 


Glu 


Cys 


Ser 


Leu 


Phe 


Ser 


Glu 


Ala 


Gly 


Asp 


His 




He 


Leu 


Ala 


Ala 


Gly 


Glu 


Cys 


Ser 


Leu 


Phe 


Ser 


Glu 


Ala 


Gly 


Asp 


His 












330 










335 










340 
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tac 


ctt 


aat 


ggg 


aat 


gcc 


att 


gtt 


gca 


act 


aca 


cca 


caa 


act 


aca 


aaa 


1171 


Tyr 


Leu 


Asn 


Gly 


Asn 


Ala 


He 


Val 


Ala 


Thr 


Thr 


Pro 


Gin 


Thr 


Thr 


Lys 




Tyr 


Leu 


Asn 


Gly 


Asn 


Ala 


He 


Val 


Ala 


Thr 


Thr 


Pro 


Gin 


Thr 


Thr 


Lys 










345 










350 










355 









aga 


aat 


tct 


att 


gac 


ata 


gga 


tct 


act 


ggc 


aaa 


gat 


cac 


gaa 


tta 


cgt 


1219 


Arg 


Asn 


Ser 


He 


Asp 


He 


Gly 


Ser 


Thr 


Gly 


Lys 


Asp 


His 


Glu 


Leu 


Arg 




Arg 


Asn 


Ser 


He 


Asp 


He 


Gly 


Ser 


Thr 


Gly 


Lys 


Asp 


His 


Glu 


Leu 


Arg 








360 










365 










370 











gca 


ata 


tct 


ggg 


cat 


age 


ate 


ttt 


ttc 


tac 


gat 


ccg 


att 


act 


get 


aat 


1267 


Ala 


He 


Ser 


Gly 


His 


Ser 


He 


Phe 


Phe 


Tyr 


Asp 


Pro 


He 


Thr 


Ala 


Asn 




Ala 


He 


Ser 


Gly 


His 


Ser 


He 


Phe 


Phe 


Tyr 


Asp 


Pro 


He 


Thr 


Ala 


Asn 






375 










380 










385 













aeg 


get 


geg 


gat 


tet 


aca 


gat 


act 


tta 


aat 


etc 


aat 


aag 


get 


gat 


gca 


1315 


Thr 


Ala 


Ala 


Asp 


Ser 


Thr 


Asp 


Thr 


Leu 


Asn 


Leu 


Asn 


Lys 


Ala 


Asp 


Ala 




Thr 


Ala 


Ala 


Asp 


Ser 


Thr 


Asp 


Thr 


Leu 


Asn 


Leu 


Asn 


Lys 


Ala 


Asp 


Ala 




390 










395 










400 










405 





ggt 


aat 


agt 


aca 


gat 


tat 


agt 


ggg 


teg 


att 


gtt 


ttt 


tct 


ggt 


gaa 


aag 


1363 


Gly 


Asn 


Ser 


Thr 


Asp 


Tyr 


Ser 


Gly 


Ser 


He 


Val 


Phe 


Ser 


Gly 


Glu 


Lys 




Gly 


Asn 


Ser 


Thr 


Asp 


Tyr 


Ser 


Gly 


Ser 


He 


Val 


Phe 


Ser 


Gly 


Glu 


Lys 












410 










415 










420 







etc 


tct 


gaa 


gat 


gaa 


gca 


aaa 


gtt 


gca 


gac 


aac 


etc 


act 


tet 


acg 


ctg 


1411 


Leu 


Ser 


Glu 


Asp 


Glu 


Ala 


Lys 


Val 


Ala 


Asp 


Asn 


Leu 


Thr 


Ser 


Thr 


Leu 




Leu 


Ser 


Glu 


Asp 


Glu 


Ala 


Lys 


Val 


Ala 


Asp 


Asn 


Leu 


Thr 


Ser 


Thr 


Leu 










425 










430 










435 









aag 


cag 


cct 


gta 


act 


eta 


act 


gca 


gga 


aat 


tta 


gta 


ctt 


aaa 


cgt 


ggt 


1459 


Lys 


Gin 


Pro 


Val 


Thr 


Leu 


Thr 


Ala 


Gly 


Asn 


Leu 


Val 


Leu 


Lys 


Arg 


Gly 




Lys 


Gin 


Pro 


Val 


Thr 


Leu 


Thr 


Ala 


Gly 


Asn 


Leu 


Val 


Leu 


Lys 


Arg 


Gly 








440 










445 










450 











gtc 


act 


etc 


gat 


aeg 


aaa 


ggc 


ttt 


act 


cag 


ace 


geg 


ggt 


tec 


tct 


gtt 


1507 


Val 


Thr 


Leu 


Asp 


Thr 


Lys 


Gly 


Phe 


Thr 


Gin 


Thr 


Ala 


Gly 


Ser 


Ser 


Val 




Val 


Thr 


Leu 


Asp 


Thr 


Lys 


Gly 


Phe 


Thr 


Gin 


Thr 


Ala 


Gly 


Ser 


Ser 


Val 






455 










460 










465 













att 


atg 


gat 


geg 


ggc 


aca 


acg 


tta 


aaa 


gca 


agt 


aca 


gag 


gag 


gtc 


act 


1555 


He 


Met 


Asp 


Ala 


Gly 


Thr 


Thr 


Leu 


Lys 


Ala 


Ser 


Thr 


Glu 


Glu 


Val 


Thr 




He 


Met 


Asp 


Ala 


Gly 


Thr 


Thr 


Leu 


Lys 


Ala 


Ser 


Thr 


Glu 


Glu 


Val 


Thr 




470 










475 










480 










485 





tta 


aca 


ggt 


ctt 


tec 


att 


cct 


gta 


gac 


tct 


tta 


ggc 


gag 


ggt 


aag 


aaa 


1603 


Leu 


Thr 


Gly 


Leu 


Ser 


He 


Pro 


Val 


Asp 


Ser 


Leu 


Gly 


Glu 


Gly 


Lys 


Lys 




Leu 


Thr 


Gly 


Leu 


Ser 


He 


Pro 


Val 


Asp 


Ser 


Leu 


Gly 


Glu 


Gly 


Lys 


Lys 












490 










495 










500 







gtt 


gta 


att 


get 


get 


tct 


gca 


gca 


agt 


aaa 


aat 


gta 


gee 


ctt 


agt 


ggt 


1651 


Val 


Val 


He 


Ala 


Ala 


Ser 


Ala 


Ala 


Ser 


Lys 


Asn 


Val 


Ala 


Leu 


Ser 


Gly 




Val 


Val 


He 


Ala 


Ala 


Ser 


Ala 


Ala 


Ser 


Lys 


Asn 


Val 


Ala 


Leu 


Ser 


Gly 










505 










510 










515 
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ccg 


att 


ctt 


ctt 


ttg 


gat 


aac 


caa ggg 


aat 


get 


tat 


gaa 


aat 


cac 


gac 


1699 


Pro 


He 


Leu 


Leu 


Leu 


Asp 


Asn 


Gin Gly 


Asn 


Ala 


Tyr 


Glu 


Asn 


His 


Asp 




Pro 


He 


Leu 


Leu 


Leu 


Asp 


Asn 


Gin Gly 


Asn 


Ala 


Tyr 


Glu 


Asn 


His 


Asp 








520 










525 








530 











tta 


gga 


aaa 


act 


caa 


gac 


ttt 


tea 


ttt 


gtg 


cag 


etc 


tct 


get 


etg 


ggt 


1747 


Leu 


Gly 


Lys 


Thr 


Gin 


Asp 


Phe 


Ser 


Phe 


Val 


Gin 


Leu 


Ser 


Ala 


Leu 


Gly 




Leu 


Gly 


Lys 


Thr 


Gin 


Asp 


Phe 


Ser 


Phe 


Val 


Gin 


Leu 


Ser 


Ala 


Leu 


Gly 






535 










540 










545 













act 


gca 


aca 


act 


aca 


gat 


gtt 


cca 


gcg 


gtt 


cct 


aca 


gta 


gca 


act 


cct 


Thr 


Ala 


Thr 


Thr 


Thr 


Asp 


Val 


Pro 


Ala 


Val 


Pro 


Thr 


Val 


Ala 


Thr 


Pro 


Thr 


Ala 


Thr 


Thr 


Thr 


Asp 


Val 


Pro 


Ala 


Val 


Pro 


Thr 


Val 


Ala 


Thr 


Pro 


550 










555 










560 










565 



aeg 


cac 


tat 


ggg 


tat 


caa 


ggt 


act 


tgg 


gga 


atg 


act 


tgg 


gtt 


gat 


gat 


Thr 


His 


Tyr 


Gly 


Tyr 


Gin 


Gly 


Thr 


Trp 


Gly 


Met 


Thr 


Trp 


Val 


Asp 


Asp 


Thr 


His 


Tyr 


Gly 


Tyr 


Gin 


Gly 


Thr 


Trp 


Gly 


Met 


Thr 


Trp 


Val 


Asp 


Asp 










570 










575 










580 





ace gca age act cca aag act aag 
Thr Ala Ser Thr Pro Lys Thr Lys 
Thr Ala Ser Thr Pro Lys Thr Lys 
585 



aca 


gcg 


aca 


tta 


get 


tgg 


acc 


aat 


Thr 


Ala 


Thr 


Leu 


Ala 


Trp 


Thr 


Asn 


Thr 


Ala 


Thr 


Leu 


Ala 


Trp 


Thr 


Asn 


590 










595 







aca 


ggc 


tac 


ctt 


ccg 


aat 


cct 


gag 


egt 


caa 


gga 


cct 


tta 


gtt 


cct 


aat 


1939 


Thr 


Gly 


Tyr 


Leu 


Pro 


Asn 


Pro 


Glu 


Arg 


Gin 


Gly 


Pro 


Leu 


Val 


Pro 


Asn 




Thr 


Gly 


Tyr 


Leu 


Pro 


Asn 


Pro 


Glu 


Arg 


Gin 


Gly 


Pro 


Leu 


Val 


Pro 


Asn 








600 










605 










610 











age 


ctt 


tgg 


gga 


tct 


ttt 


tea 


gac 


ate 


caa 


gcg 


att 


caa 


ggt 


gtc 


ata 


1987 


Ser 


Leu 


Trp 


Gly 


Ser 


Phe 


Ser 


Asp 


He 


Gin 


Ala 


He 


Gin 


Gly 


Val 


He 




Ser 


Leu 


Trp 


Gly 


Ser 


Phe 


Ser 


Asp 


He 


Gin 


Ala 


He 


Gin 


Gly 


Val 


He 






615 










620 










625 













gag 


aga 


agt 


get 


ttg 


act 


ctt 


tgt 


tea 


gat 


cga 


ggc 


tte 


tgg 


get 


gcg 


Glu 


Arg 


Ser 


Ala 


Leu 


Thr 


Leu 


Cys 


Ser 


Asp 


Arg 


Gly 


Phe 


Trp 


Ala 


Ala 


Glu 


Arg 


Ser 


Ala 


Leu 


Thr 


Leu 


Cys 


Ser 


Asp 


Arg 


Gly 


Phe 


Trp 


Ala 


Ala 


630 










635 










640 










645 



gga 


gtc 


gee 


aat 


tte 


tta 


gat 


aaa 


gat 


aag 


aaa 


ggg 


gaa 


aaa 


egc 


aaa 


Gly 


Val 


Ala 


Asn 


Phe 


Leu 


Asp 


Lys 


Asp 


Lys 


Lys 


Gly 


Glu 


Lys 


Arg 


Lys 


Gly 


Val 


Ala 


Asn 


Phe 


Leu 


Asp 


Lys 


Asp 


Lys 


Lys 


Gly 


Glu 


Lys 


Arg 


Lys 










650 










655 










660 





tac 


egt 


cat 


aaa 


tct 


ggt 


gga 


tat 


get 


ate 


gga 


ggt gca 


gcg 


caa 


act 


Tyr 


Arg 


His 


Lys 


Ser 


Gly 


Gly 


Tyr 


Ala 


He 


Gly 


Gly Ala 


Ala 


Gin 


Thr 


Tyr 


Arg 


His 


Lys 


Ser 


Gly 


Gly 


Tyr 


Ala 


He 


Gly 


Gly Ala 


Ala 


Gin 


Thr 








665 










670 








675 







tgt 


tct 


gaa 


aac 


tta 


att 


age 


ttt 


gee 


ttt 


tge 


caa 


etc 


ttt 


ggt 


age 


2179 


Cys 


Ser 


Glu 


Asn 


Leu 


He 


Ser 


Phe 


Ala 


Phe 


Cys 


Gin 


Leu 


Phe 


Gly 


Ser 




Cys 


Ser 


Glu 


Asn 


Leu 


He 


Ser 


Phe 


Ala 


Phe 


Cys 


Gin 


Leu 


Phe 


Gly 


Ser 








680 










685 










690 
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gat 
Asp 
Asp 


aaa 
Lys 
Lys 
695 


gat 
Asp 
Asp 


ttc 
Phe 
Phe 


tta 
Leu 
Leu 


gcc 
Ala 
Ala 
710 


ttc 

T>Vi A 

pne 
Phe 


tat 
Tyr 
Tyr 


ate 
x±e 
lie 


caa 
oXn 
Gin 


etc 
Leu 
Leu 


tta 
Leu 
Leu 


gat 
Asp 
Asp 


aaa 
Lys 
Lys 


ctt 
Leu 
Leu 
730 


gaa 
Glu 
Glu 


ggg 

Gly 
Gly 


cag 
Gin 
Gin 


etc 
Leu 
Leu 
745 


get 
Ala 
Ala 


tat 
Tyr 
Tyr 


act 
Thr 
Thr 


gcg 
Ala 
Ala 
760 


tat 
Tyr 
Tyr 


cct 
Pro 
Pro 


aac 
Asn 
Asn 


atg 
Met 
Met 
775 


atg 
Met 
Met 


ttg 
Leu 
Leu 


gga 
Gly 
Gly 


tgt 
Cys 
Cys 
790 


ttt 
Phe 
Phe 


gat 
Asp 
Asp 


ace 
Thr 
Thr 


tat 
Tyr 
Tyr 


cgt 
Arg 
Arg 


cag 
Gin 
Gin 


gac 
Asp 
Asp 


age 
Ser 
Ser 


ttc 
Phe 
Phe 
810 


gac 
Asp 
Asp 


age 
Ser 
Ser 


aae 
Asn 
Asn 


etc 
Leu 
Leu 
825 


ttc 
Phe 
Phe 


aag 
Lys 
Lys 


ttc 
Phe 
Phe 


tct 
Ser 
Ser 
840 


gat 
Asp 
Asp 


tgt 
Cys 
Cys 


gtt 
Val 
Val 


cct 
Pro 
Pro 


gat 
Asp 
Asp 


ctt 
Leu 
Leu 


ate 
He 
He 



855 
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gtc 
Val 
Val 


get 
Ala 
Ala 
700 


aaa 
Lys 
Lys 


aat 
Asn 
Asn 


cat 
His 
His 


act 
Thr 
Thr 


gat 
Asp 
Asp 
705 


cac 
His 
His 
715 


att 
He 
He 


aca 
Thr 
Thr 


gaa 
Glu 
Glu 


tgt 
Cys 
Cys 


agt 
Ser 
Ser 
720 


ggg 

Gly 
Gly 


cct 
Pro 
Pro 


ggc 
Gly 
Gly 


tct 
Ser 
Ser 


tgg 
Trp 
Trp 


agt 
Ser 
Ser 
735 


cat 
His 
His 


aaa 
Lys 
Lys 


tat 
Tyr 
Tyr 


age 
Ser 
Ser 


cac 
His 
His 


gtc 
Val 
Val 
750 


agt 
Ser 
Ser 


aat 
Asn 
Asn 


gat 
Asp 
Asp 


gag 
Glu 
Glu 


gtg 
Val 
Val 


aaa 
Lys 
Lys 
765 


ggt 
Gly 
Gly 


tct 
Ser 
Ser 


tgg 
Trp 
Tm 


ggg 

Gly 
Gly 


get 
Ala 
Ala 


tct 
Ser 
Ser 
780 


tct 
Ser 
Ser 


cat 
His 
His 


tct 
Ser 
Ser 


tat 
Tyr 
Tvr 

J. y J. 


cct 
Pro 
Pro 
785 


get 
Ala 
Ala 
795 


cea 
Pro 
Pro 


tae 
Tyr 
Tyr 


ate 
He 
He 


aaa 
Lys 
Lys 


ctg 
Leu 
Leu 
800 


aat 
Asn 
Asn 


teg 
Ser 
Ser 


gag 
Glu 
Glu 


aaa 
Lys 
Lys 


ggt 
Gly 
Gly 


aca 
Thr 
Thr 
815 


gaa 
Glu 
Glu 


gga 
Gly 
Gly 


aat 
Asn 
Asn 


tta 
Leu 
Leu 


tct 
Ser 
Ser 


ttg 
Leu 
Leu 
830 


cct 
Pro 
Pro 


ata 
He 
He 


ggg 

Gly 
Gly 


aat 
Asn 
Asn 


gac 
Asp 
Asp 


ttt 
Phe 
Phe 
845 


tct 
Ser 
Ser 


tat 
Tyr 
Tyr 


gat 
Asp 
Asp 


ctg 
Leu 
Leu 


cgc 
Arg 
Arg 


aat 
Asn 
Asn 
860 


gat 
Asp 
Asp 


ccc 
Pro 
Pro 


aaa 
Lys 
Lys 


tgc 
Cys 
Cys 


act 
Thr 
Thr 
865 
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ace 
Thr 
Thr 


tat 
Tyr 
Tyr 


gca 
Ala 
Ala 


gga 
Gly 
Gly 


2227 


ttc 
Phe 
Phe 


ata 
He 
He 


ggt 
Gly 
Gly 


tgt 

Cys 
Cys 
725 


2275 


ccc 
Pro 
Pro 


etc 
Leu 
Leu 


gtt 
Val 
Val 
740 


tta 
Leu 
Leu 


2323 


ctg 
Leu 
Leu 


aag 
Lys 
Lys 
755 


aca 
Thr 
Thr 


aag 
Lys 
Lys 


2371 


aat 
Asn 
Asn 
770 


aat 
Asn 
Asn 


get 
Ala 
Ala 


ttt 
Phe 
Phe 


2419 


gaa 
Glu 
Glu 


tae 
Tyr 
Tyr 


ctg 
Leu 
Leu 


cat 
His 
His 


2467 


ctg 
Leu 
Leu 


ace 
Thr 
Thr 


tat 
Tyr 
Tyr 


ata 
He 
He 
805 


2515 


aga 
Arg 
Arg 


tct 
Ser 
Ser 


ttt 
Phe 
Phe 
820 


gat 
Asp 
Asp 


2563 


gtg 
Val 
Val 


aag 
Lys 
Lys 
835 


ttt 
Phe 
Phe 


gag 
Glu 
Glu 


2611 


act 
Thr 
Thr 
850 


tta 
Leu 
Leu 


tec 
Ser 
Ser 


tat 
Tyr 
Tyr 


2659 


aca 
Thr 
Thr 


gca 
Ala 
Ala 


ctt 
Leu 
Leu 


gta 

Val 
Val 


2707 
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J 



ate 


age 


gga 


gcc 


tct 


tgg 


gaa 


act 


tat 


gcc 


aat 


aac 


tta 


gca 


cga 


cag 


2755 


He 


Ser 


Gly 


Ala 


Ser 


Trp 


Glu 


Thr 


Tyr 


Ala 


Asn 


Asn 


Leu 


Ala 


Arg 


Gin 




He 


Ser 


Gly 


Ala 


Ser 


Trp 


Glu 


Thr 


Tyr 


Ala 


Asn 


Asn 


Leu 


Ala 


Arg 


Gin 




870 










875 










880 










885 




gcc 


ttg 


caa 


gtg 


cgt 


gca 


ggc 


agt 


cac 


tac 


gcc 


ttc 


tct 


cct 


atg 


ttt 


2803 


Ala 


Leu 


Gin 


Val 


Arg 


Ala 


Gly 


Ser 


His 


Tyr 


Ala 


Phe 


Ser 


Pro 


Met 


Phe 




Ala 


Leu 


Gin 


Val 


Arg 


Ala 


Gly 


Ser 


His 


Tyr 


Ala 


Phe 


Ser 


Pro 


Met 


Phe 





890 895 900 
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gaa 


gtg 


etc 


ggc 


cag 


ttt 


gtc 


ttt 


gaa 


gtt 


cgt 


gga tec tea egg att 


2851 


Glu 


Val 


Leu 


Gly 


Gin 


Phe 


Val 


Phe 


Glu 


Val 


Arg 


Gly Ser Ser Arg He 




Glu 


Val 


Leu 


Gly 


Gin 


Phe 


Val 


Phe 


Glu 


Val 


Arg 


Gly Ser Ser Arg He 










905 










910 






915 




tat 


aat 


gta 


gat 


ctt 


ggg 


ggt 


aag 


ttc 


caa 


ttc 


taggagcgtc tctcatgtct 


2904 


Tyr 


Asn 


Val 


Asp 


Leu 


Gly 


Gly 


Lys 


Phe 


Gin 


Phe 






Tyr 


Asn 


Val 


Asp 


Leu 


Gly 


Gly 


Lys 


Phe 


Gin 


Phe 







920 925 



cagaaattct gagagagatc gcatttagga ttttcttaaa cacgac 



2950 
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Figure 4 A: Restriction enzyme analysis of CPNl 00635 
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Apol 
Tsp509I 
Hpyl78III 
Alul 
CviJI 
MspAlI 
PvuII 



Ddel 
Hpyl88IX 
Hinf I 
Hpyl78III I 
BseMIll I 

Msel PvuII I I Plel I I I 

I II I 

AGTTAAAGATGACAAAACAGCTGTCAAGAATTTTTATCTTGACTCTCTGAGTTTTCTATT 
TCAATTTCTACTGTTTTGTCGACAGTTCTTAAAAATAGAACTGAGAGACTCAAAAGATAA 



61 



ApoI ScrFI 
Tsp509I ECORII I 

Hgal I Msel Tsp509I [ | 

III III 
TTATATGACGCAAGTAAGAATTTAATAATAAAGTGGGTTTATGAAATCGCAATTTTCCTG 

AATATACTGCGTTCATTCTTAAATTATTATTTCACCCAAATACTTTAGCGTTAAAAGGAC 



120 



BsiHKAI 
Bspl286I 
MboII I 
I I 



Earl 
Taqi I 



Nlalll 
MslI Nspl 
I I 



Bfal 
Spel I 



Mwol 
CviRI 
Fnu4HI I 
TspRI I I 

Taal |Tsel| | 
II III 



121 



Bbvl 

I I I II I 

GTTAGTGCTCTCTTCGACATTGGCATGTTTTACTAGTTGTTCCACTGTTTTTGCTGCAAC 

CAATCACGAGAGAAGCTGTAACCGTACAAAATGATCAACAAGGTGACAAAAACGACGTTG 



180 



Hpyl88IX 
NlalV 
CviJI I 
Haelll I 
ECOO109I I I 
Sau96I I I 
III 



Mnll 
Alul I 
CviJI I 

II 



Rsal 
Seal 
TatI I 
I I 



Nlalv 
BanI |BsaXI 
I I I 



TGCTGAAAATATAGGCCCCTCTGATAGCTTTGACGGAAGTACTAACACAGGCACCTATAC 

181 + + + + + + 240 

ACGACTTTTATATCCGGGGAGACTATCGAAACTGCCTTCATGATTGTGTCCGTGGATATG 



Cjel 

Cjel Bsri Hpyl88IX CviRI 

I II I 

TCCTAAAAATACGACTACTGGAATAGACTATACTCTGACAGGAGATATAACTCTGCAAAA 

241 + + + + + + 300 

AGGATTTTTATGCTGATGACCTTATCTGATATGAGACTGTCCTCTATATTGAGACGTTTT 



-4 

MAR 0 B 2004 " 



24/77 



REPLACEMENT SHEET 
Title: CHLAMYDIA ANTIGENS AND 
CORRESPONDING ON A FRAGMENTS AND 
USES THEREOF 
Inventor(s): Andrew D. MURDIN et al. 
Appl. No.: 09/857,128 



Msel 

Alul 
CviJI 

BsaJI Hinfl Fnu4HI | 
Styl Tfil Tsel| I 
I I III 



Alul 
CviJI 

Hinfl Hindlll | 
Bbvl Hpyl88IX Tfil Msel | | 

I I I I II 

CCTTGGGGATTCGGCAGCTTTAACGAAGGGTTGTTTTTCTGACACTACGGAATCTTTAAG 

301 + + + + + + 360 

GGAACCCCTAAGCCGTCGAAATTGCTTCCCAACAAAAAGACTGTGATGCCTTAGAAATTC 



Fnu4HI 
Hhal I 

Mspl Dral Msel Tsel | 

BsrFl| BslI Rsal Msel|SspI | Bfal Mwol || 

II I I II I I I I II 

CTTTGCCGGTAAGGGGTACTCACTTTCTTTTTTAAATATTAAGTCTAGTGCTGAAGGCGC 

361 + + + + + + 

GAAACGGCCATTCCCCATGAGTGAAAGAAAAAATTTATAATTCAGATCACGACTTCCGCG 



420 



EC057I 

Bbvl I Hinfl 
CviJI Maelllll BsaBI TaqI | Plel 

I III I III 

AGCCCTTTCTGTTACAACTGATAAAAATCTGTCGCTAACAGGATTTTCGAGTCTTACTTT 

TCGGGAAAGACAATGTTGACTATTTTTAGACAGCGATTGTCCTAAAAGCTCAGAATGAAA 



480 



Bed 
NlalV 
CviJI 
Haelll 
Sau96I 
Fnu4HI 



Taul III I Hpyl78III 

Acil I I jj I Cjel 

Cjel I 1 I I I I BSU36I | 

Ddel I I I I I I |bs1i Ddel j 

II Mill II II 
CTTAGCGGCCCCATCATCGGTAATCACAACCCCCTCAGGAAAAGGTGCAGTTAAATGTGG 

481 + + + + + + 

GAATCGCCGGGGTAGTAGCCATTAGTGTTGGGGGAGTCCTTTTCCACGTCAATTTACACC 



Mnll 
Msel 
BseMII I 
CviRI I I 
Mnll I j I 
I III 



540 



Bsgl 
Dpnl 
BstYI I 
Sau3AI I 



541 



Dral Taal 
Alwl Msel I Mnll I 

I II I I 

AGGGGATCTTACATTTGATAACAATGGAACTATTTTATTTAAACAAGATTACTGTGAGGA 

TCCCCTAGAATGTAAACTATTGTTACCTTGATAAAATAAATTTGTTCTAATGACACTCCT 



600 
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Clal 

CviJI TaqI 
NlalVl Dpnl I 

Ecil I I Hinf I Sau3AI | | 

Acil| II Tfil Sthl32I BscGl| || Alwl 

II II I I II II I 

AAATGGCGGAGCCATTTCTACCAAGAATCTTTCTTTGAAAAACAGCACGGGATCGATTTC 

601 + + + + + + 660 

TTTACCGCCTCGGTAAAGATGGTTCTTAGAAAGAAACTTTTTGTCGTGCCCTAGCTAAAG 

Hhal 

CjePl| CjePI 
TaqI I I Cviai Mwol | 

I II III 

TTTTGAAGGGAATAAATCGAGCGCAACAGGGAAAAAAGGTGGGGCTATTTGTGCTACTGG 

661 + + + + + + 720 

AAAACTTCCCTTATTTAGCTCGCGTTGTCCCTTTTTTCCACCCCGATAAACACGATGACC 



Taal 
Sf cl I 
BsrI I I 
Rsal I I 
I II 



MboII 
NlalVl 
CviJI I I 
III 



BspMI 
Sspl 
Bbvl 
Mnll 



Earl 
Hpyl78III 
TaqI 
Bcefl I 
I I 



Fnu4HI 
Sf cl 
Alul 
CviJI 
Mwol 
Tsel 
I 



TACTGTAGATATTACAAATAATACGGCTCCTACCCTCTTCTCGAACAATATTGCTGAAGC 
ATGACATCTATAATGTTTATTATGCCGAGGATGGGAGAAGAGCTTGTTATAACGACTTCG 



780 



EC057I 
Alul 
CviJI 
Mwol 
Aarl I 
Pstl| I 
CviRI I I I 
I II I 



781 



Tsp509I 
Rsal 
BsrGI I 

Taal I I BsmAI 
TatI I I Maell BsmBI 

III II 
TGCAGGTGGAGCTATAAATAGCACAGGAAACTGTACAATTACAGGGAATACGTCTCTTGT 

ACGTCCACCTCGATATTTATCGTGTCCTTTGACATGTTAATGTCCCTTATGCAGAGAACA 



BseRI 
Hpyl78III 
Banll 
BsiHKAI 
Bspl286I 
Sad 
Sf aNI 

Maelll Acil Alul | 

Hpyl88IX Tsp45I BsiEI Mnll CviJI | | jCjel 

I I II II 

ATTTTCTGAAAATAGTGTGACAGCGACCGCAGGAAATGGAGGAGCTCTTTCTGGAGATGC 

841 + + + + + + 900 

TAAAAGACTTTTATCACACTGTCGCTGGCGTCCTTTACCTCCTCGAGAAAGACCTCTACG 
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Maelll 



Maelll 
Hpyl8 8ix 
Hinfl I 
BslI I I 
Bpml |TfiI I 
I I I I 



Hpyl78III 
Cjel Ddel I 
I I I 



BseMII 
Sfcl 
Alul I Alul 
Cvi JI I Cvi JI 
II I 



CGATGTTACCATATCTGGGAATCAGAGTGTAACTTTCTCAGGAAACCAAGCTGTAGCTAA 

901 + + + + + + 

GCTACAATGGTATAGACCCTTAGTCTCACATTGAAAGAGTCCTTTGGTTCGACATCGATT 



960 



Neil 
ScrFI 
BsaJI 
Mspl 
Bsri 
TspRI 
CviJI 



CviJI 
Nlaiv| 
Ecil I I 
Acill II 
II 11 



Sthl32I 
Alul 
CviJI 
Hindlll I 
Ddel I 



Mwol 



BslI 



Hinfl 
Tf il 



TGGCGGAGCCATTTATGCTAAGAAGCTTACACTGGCTTCCGGGGGGGGGGGGGGGAATCC 
961 + + + + + + 1020 

ACCGCCTCGGTAAATACGATTCTTCGAATGTGACCGAAGGCCCCCCCCCCCCCCCTTAGG 



PstI 
TspRI 
CviRI 
HgiEII 
BtsI 
Sfcl 
Kpnl 
NlalV 



Rsal 
BanI 
BspMI 
BsaJI I 
Styl I 
Cjell I 

II I 



CviJI 
Nlaiv| 

I I 



Xcml 
Cjel I 
I I 



CTTTTCTAACAATATAGTCCAAGGTACCACTGCAGGTAATGGTGGAGCCATTTCTATACT 

1021 + + + + + + 1080 

GAAAAGATTGTTATATCAGGTTCCATGGTGACGTCCATTACCACCTCGGTAAAGATATGA 
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Alul 
CviJI 
MspAlI 
PvuII 
Bsri I 
Fnu4HI I 
Tsel| I 
II I 



Bbvl 



Bpml 
I 



Nlaiv 
Avail I 
Sau96I I 
I I 



BsmFI 
Msel I 
Xcml I I 

II I 



GGCAGCTGGAGAGTGTAGTCTTTTCAGCGAAGCAGGGGACCATTACCTTAATGGGAATGC 

1081 + + + + + + 1140 

CCGTCGACCTCTCACATCAGAAAAGTCGCTTCGTCCCCTGGTAATGGAATTACCCTTACG 

Dpnl 
BstYI 

CviRI Sau3AI 
Mwol I Apol HaelV I I Bcgl 

BsmI I I Tsp509I Hin4I j | Alwl | 

III I I II II 

CATTGTTGCAACTACACCACAAACTACAAAAAGAAATTCTATTGACATAGGATCTACTGG 

1141 + + + + + + 1200 

GTAACAACGTTGATGTGGTGTTTGATGTTTTTCTTTAAGATAACTGTATCCTAGATGACC 



Tsp509I 
Hpyl78III 
Dpnl I 
Sau3AI I I 
BsrI I I I 

I II I 



CviRI 
BsaAI I 
Maell| I 

II I 



Bcgl 



HpylSBIX 
Dpnl 
Sau3AI 
Pf 111081 I 
SfaNI I I 
Alwl I I I 

II II 



Bbvl 



CAAAGATCACGAATTACGTGCAATATCTGGGCATAGCATCTTTTTCTACGATCCGATTAC 

1201 + + + + + + 1260 

GTTTCTAGTGCTTAATGCACGTTATAGACCCGTATCGTAGAAAAAGATGCTAGGCTAATG 



Hinf I 
Tf il 
Acil 
Fnu4HI I 

CviJI I I I CviJI 
Tselj I I Beef I Dral BspMI | 

Mwol II I jsfcl I Msel I SfaNI | | CviRI 

I II I I I I II III I 

TGCTAATACGGCTGCGGATTCTACAGATACTTTAAATCTCAATAAGGCTGATGCAGGTAA 

1261 + + + + + + 1320 

ACGATTATGCCGACGCCTAAGATGTCTATGAAATTTAGAGTTATTCCGACTACGTCCATT 



Hpyl8 8IX 
HphI I 

Rsal TaqI Alul | | 

TatI I SimI I CviJI | | 

II II III 

TAGTACAGATTATAGTGGGTCGATTGTTTTTTCTGGTGAAAAGCTCTCTGAAGATGAAGC 

1321 + + + + + + 1380 

ATCATGTCTAATATCACCCAGCTAACAAAAAAGACCACTTTTCGAGAGACTTCTACTTCG 
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EC057I 
CviRI 
BstAPI I 
Mwol I 
MboII I I 

I I I 



Maelll 
CviJI 
Fnu4HI I 
Mwol I I 
Mnll Tselj | 

I II I 

AAAAGTTGCAGACAACCTCACTTCTACGCTGAAGCAGCCTGTAACTCTAACTGCAGGAAA 

1381 + + + + + + 1440 

TTTTCAACGTCTGTTGGAGTGAAGATGCGACTTCGTCGGACATTGAGATTGACGTCCTTT 



Apol 
Tsp509I 
PstI 
CviRI 
Sf cl I 
Eco57l| I 
Bbvl I I I 

I II I 



BseMII 
NlalV 

SacII 

Acil 



Mae 1 1 
Msel 
Rsal I 
Seal I 
TatI I I 
I I I 



MspAlI 
Thai 
Acil 
BsaJI 
BstDSI 
Hpyl8 8IX 
Faul 



Hpyl78III Ddel| 
iMaelll I Sthl32l| 

|Tsp45I Taqi CviJI || 

II I II 

TTTAGTACTTAAACGTGGTGTCACTCTCGATACGAAAGGCTTTACTCAGACCGCGGGTTC 

1441 + + + + + + 1500 

AAATCATGAATTTGCACCACAGTGAGAGCTATGCTTTCCGAAATGAGTCTGGCGCCCAAG 



FauI 
Mnll 
Sthl32I I 
SfaNI I I 

I II 



Msel 
Acll 
Fokl 
Maell 
Bspl286I 
Bmgl 
BseSI 
CacSI I 
Acil I I 
I I I 



Mnll 
Rsal 
TatI I 
Mnll I I 

M I 



Maelll 
Tsp45I 



BseRI 
Msel I 



I 



CTCTGTTATTATGGATGCGGGCACAACGTTAAAAGCAAGTACAGAGGAGGTCACTTTAAC 

1501 + + + + + + 1560 

GAGACAATAATACCTACGCCCGTGTTGCAATTTTCGTTCATGTCTCCTCCAGTGAAATTG 



Hinf I 
Acci I 

Sfcl I I Fnu4HI Sfcl 

Plel I I I Mnll Bbvl Tsp509I Tsel | Mwol | 

II II I I I II II 

AGGTCTTTCCATTCCTGTAGACTCTTTAGGCGAGGGTAAGAAAGTTGTAATTGCTGCTTC 

1561 + + + + + + 

TCCAGAAAGGTAAGGACATCTGAGAAATCCGCTCCCATTCTTTCAACATTAACGACGAAG 



1620 



Fig. 4 G 
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Hinf I 

PstI Tfil 

Fnu4Hl| Hpyl88IX 

CviRI I I Avail I 

Mwoljj Ddel MboII | | BsaJI 

Tseljj Bbvl CviJI |Sau96I | | Styl 

III I I I I I 

TGCAGCAAGTAAAAATGTAGCCCTTAGTGGTCCGATTCTTCTTTTGGATAACCAAGGGAA 

1621 + + + + + + 1680 

ACGTCGTTCATTTTTACATCGGGAATCACCAGGCTAAGAAGAAAACCTATTGGTTCCCTT 



1681 



Alul 
CviJI 
Fnu4HI I 

Hpyl78III CviRI I j 

Smll I Tsel I i 

II III 
TGCTTATGAAAATCACGACTTAGGAAAAACTCAAGACTTTTCATTTGTGCAGCTCTCTGC 

ACGAATACTTTTAGTGCTGAATCCTTTTTGAGTTCTGAAAAGTAAACACGTCGAGAGACG 



BsmI 



Ddel 
Bce83I I 
HpylVBIII I 

I I 



1740 



Sf Cl 

CviRI NlalV I 

Acelll Bsgl I Acil | | Cjel 

Bbvl I Rsal| I Sfcl MspAlI | |TaaI | 

II II I I I I I I I 

TCTGGGTACTGCAACAACTACAGATGTTCCAGCGGTTCCTACAGTAGCAACTCCTACGCA 

1741 + + + + + + 1800 

AGACCCATGACGTTGTTGATGTCTACAAGGTCGCCAAGGATGTCATCGTTGAGGATGCGT 

Cjel Cac8I 
Rsal I Acil I 

II II 
CTATGGGTATCAAGGTACTTGGGGAATGACTTGGGTTGATGATACCGCAAGCACTCCAAA 

1801 + + + + + + 1860 

GATACCCATAGTTCCATGAACCCCTTACTGAACCCAACTACTATGGCGTTCGTGAGGTTT 



Tthlllll 
Ddel I 
I I 



1861 



Avail 
Sau96I 
Alul I 
CviJI I 
Mwol I I 
I I I 



BpulOI 
Ddel 
Hpyl78III 
Hinfl 
Tfil 
Hgal 



Hpyl8 8IX 
BseMII I 
CviJI I I 

I I I 

GACTAAGACAGCGACATTAGCTTGGACCAATACAGGCTACCTTCCGAATCCTGAGCGTCA 
CTGATTCTGTCGCTGTAATCGAACCTGGTTATGTCCGATGGAAGGCTTAGGACTCGCAGT 
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Avail 

EcoOlOSI Dpnl 

PspSII BstYI I 

Sau96I Fokl |Hpyl88IX Hinfl 

Sse8647I Alol CviJI Sau3AI | Alwl | Tfil 

II I I I I I I 

AGGACCTTTAGTTCCTAATAGCCTTTGGGGATCTTTTTCAGACATCCAAGCGATTCAAGG 

1921 + + + + + + 1980 

TCCTGGAAATCAAGGATTATCGGAAACCCCTAGAAAAAGTCTGTAGGTTCGCTAAGTTCC 




Mwol 
Hinfl 
Acil 
Fnu4HI 
CviJI 



Bbvl 
TaqI 
Dpnl 
Sau3AI I 
Hpyl8 8IX| I 
Plel Hinfl Mnll | | | 

II ill! 
TGTCATAGAGAGAAGTGCTTTGACTCTTTGTTCAGATCGAGGCTTCTGGGCTGCGGGAGT 

1981 + + + + + + 2040 

ACAGTATCTCTCTTCACGAAACTGAGAAACAAGTCTAGCTCCGAAGACCCGACGCCCTCA 



Faul I 
Sthl32l| I 
CviJI I I Tsel 
III I 



Plel 

Tsp509l| Ddel Taal 

Ml I 

CGCCAATTTCTTAGATAAAGATAAGAAAGGGGAAAAACGCAAATACCGTCATAAATCTGG 

2041 + + + + + + 2100 

GCGGTTAAAGAATCTATTTCTATTCTTTCCCCTTTTTGCGTTTATGGCAGTATTTAGACC 



Hpyl8 8IX 
Mnll I 
Haeiv| I 
Hin4l| I 
II I 



Tsp509I 

Fnu4HI Bsgl I 

CviRl| Hpyl88IX | | Alul 

Tsel|HhaI Bbvl | | Msel | CviJI 

Ml I I I II I 

TGGATATGCTATCGGAGGTGCAGCGCAAACTTGTTCTGAAAACTTAATTAGCTTTGCCTT 

2101 + + + + + + 2160 

ACCTATACGATAGCCTCCACGTCGCGTTTGAACAAGACTTTTGAATTAATCGAAACGGAA 



BstXI 



Ddel 



TTGCCAACTCTTTGGTAGCGATAAAGATTTCTTAGTCGCTAAAAATCATACTGATACCTA 
2161 + + + + + + 2220 

AACGGTTGAGAAACCATCGCTATTTCTAAAGAATCAGCGATTTTTAGTATGACTATGGAT 



CviJI 

NlaIV| BsmAI 
CviRI II Bsbl Mmel Ddel | 

I II I I II 

TGCAGGAGCCTTCTATATCCAACACATTACAGAATGTAGTGGGTTCATAGGTTGTCTCTT 

2221 + + + + + + 2280 

ACGTCCTCGGAAGATATAGGTTGTGTAATGTCTTACATCACCCAAGTATCCAACAGAGAA 
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CviJI 
ScrFI I 
EcoRII I I 
I I I 



Hinf I 



Plel 



Cac8I 
Alul 
CviJI 
Fnu4HI I 
Mnll Tsel| | 
I II I 



AGATAAACTTCCTGGCTCTTGGAGTCATAAACCCCTCGTTTTAGAAGGGCAGCTCGCTTA 

2281 + + + + + + 2340 

TCTATTTGAAGGACCGAGAACCTCAGTATTTGGGGAGCAAAATCTTCCCGTCGAGCGAAT 



Mae 1 1 
CviJI I 
Acellll j 
Bbvl I I 1 

III I 



BSU36I 
Ddel 
Hpyl78III 
EC057I 
BseMII 
MboII 



Hpyl88IX BstZ17l| 
Dpnl I AccI I I 

SauBAI I 1 Bbsl {|| | Mnll | | BciVI 

III I III 

TAGCCACGTCAGTAATGATCTGAAGACAAAGTATACTGCGTATCCTGAGGTGAAAGGTTC 

2341 + + + + + + 2400 

ATCGGTGCAGTCATTACTAGACTTCTGTTTCATATGACGCATAGGACTCCACTTTCCAAG 



MboII Alul 

HphI Msel Nlalll I CviJI Hpyl78III 

I I I I I I 

TTGGGGGAATAATGCTTTTAACATGATGTTGGGAGCTTCTTCTCATTCTTATCCTGAATA 

2401 + + + + + + 2460 

AACCCCCTTATTACGAAAATTGTACTACAACCCTCGAAGAAGAGTAAGAATAGGACTTAT 

HpylBBIX 

BspMI Hinfl I Hpyl78III 

CviRI I BsaXI Tfil | Maell | 

II I II II 

CCTGCATTGTTTTGATACCTATGCTCCATACATCAAACTGAATCTGACCTATATACGTCA 

2461 + + + + + + 2520 

GGACGTAACAAAACTATGGATACGAGGTATGTAGTTTGACTTAGACTGGATATATGCAGT 

Hpyl88IX Dpnl 
Alul I Bglll I 

CviJI I BstYI j MboII Earl 

Hin4I I Rsal SauBAI | MboII | Tsp509l| 

II I I I I I II 

GGACAGCTTCTCGGAGAAAGGTACAGAAGGAAGATCTTTTGATGACAGCAACCTCTTCAA 

2521 + + + + + + 2580 

CCTGTCGAAGAGCCTCTTTCCATGTCTTCCTTCTAGAAAACTACTGTCGTTGGAGAAGTT 



HpylSeiX 
Hin4I I 

Mnll Sfcl HphI I I BplI 

II III I 

TTTATCTTTGCCTATAGGGGTGAAGTTTGAGAAGTTCTCTGATTGTAATGACTTTTCTTA 

2581 + + + + + + 2640 

AAATAGAAACGGATATCCCCACTTCAAACTCTTCAAGAGACTAACATTACTGAAAAGAAT 
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Hpyl8 8IX 
Dpnl I 
Sau3AI I I 
I I I 



Dpnl 
Sau3AI I 
Hpyl78IIl| I 
II I 



BsrDI 
Dpnl 
Sau3AI I i 
Alwl I I I Sfcl 
Acil I III CviRI I 

II I I 

TGATCTGACTTTATCCTATGTTCCTGATCTTATCCGCAATGATCCCAAATGCACTACAGC 

2641 + + + + + + 2700 

ACTAGACTGAAATAGGATACAAGGACTAGAATAGGCGTTACTAGGGTTTACGTGATGTCG 

CviJI 

CviJI Hael 
Nlaiv| Haelll 
Acil I I StuI 
MspAlI I I Mnll Ddel Mwol | 

I II I I II 

ACTTGTAATCAGCGGAGCCTCTTGGGAAACTTATGCCAATAACTTAGCACGACAGGCCTT 

2701 + + + + + + 2760 

TGAACATTAGTCGCCTCGGAGAACCCTTTGAATACGGTTATTGAATCGTGCTGTCCGGAA 



CviRI 



Maelll 
Tsp4 5I 
CacSI I 
CviRI I I 
CacBI I I I 

III I 



Bsgl 
Tthlllll I 



BsrI 
CviJI 
Haelll 
Eael 
Gdill 
BsiHKAI I 
Bspl286I I 

II 



GCAAGTGCGTGCAGGCAGTCACTACGCCTTCTCTCCTATGTTTGAAGTGCTCGGCCAGTT 

2761 + + + + + + 2820 

CGTTCACGCACGTCCGTCAGTGATGCGGAAGAGAGGATACAAACTTCACGAGCCGGTCAA 



Dpnl 
NlalV 
BamHI 
BstYI 
SauSAI 
Alwl I 
I I 



Dpnl 
Bglll I 
BstYI I 
SauSAI I 



Hgal 
Tsp509I 



Alwl Mnll 

II II 
TGTCTTTGAAGTTCGTGGATCCTCACGGATTTATAATGTAGATCTTGGGGGTAAGTTCCA 

2821 + + + + + + 2880 

ACAGAAACTTCAAGCACCTAGGAGTGCCTAAATATTACATCTAGAACCCCCATTCAAGGT 
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Dpnl 
Sau3AI 
BseMII 
Ddel 
Hpyl8 8IX 
BplI 



Bfal 
BslI 



Apol 
Tsp509I 
BsmAI 
HpylSSIX 
BseMII 
Ddel 
Nlalll 
BsmAI I 
BsmBI 



Hin4I 



Msel 



I III 
ATTCTAGGAGCGTCTCTCATGTCTCAGAAATTCTGAGAGAGATCGCATTTAGGATTTTCT 

2881 + + + + + + 2940 

TAAGATCCTCGCAGAGAGTACAGAGTCTTTAAGACTCTCTCTAGCGTAAATCCTAAAAGA 



TAAACACGAC 

2941 + 2950 

ATTTGTGCTG 
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Nucleotide and deduced amino acid sequence of CPN100638 



tgtagattct taacttacta gtctctcctt tcctcttgct ttctttaatt tattgcagta 60 

tgtggtgaaa taatttgtta aaccacctat agccctctac atg aaa tec tct ctt 115 

Met Lys Ser Ser Leu 
1 5 

cat tgg ttt tta ate teg tea tct tta gca ctt cec ttg tea eta aat 163 
His Trp Phe Leu lie Ser Ser Ser Leu Ala Leu Pro Leu Ser Leu Asn 
10 15 20 

ttc tct gcg ttt get get gtt gtt gaa ate aat eta gga ect acc aat 211 
Phe Ser Ala Phe Ala Ala Val Val Glu lie Asn Leu Gly Pro Thr Asn 
25 30 35 

age ttc tct gga cea gga ace tac act ect cca gee caa aca aea aat 259 
Ser Phe Ser Gly Pro Gly Thr Tyr Thr Pro Pro Ala Gin Thr Thr Asn 
40 45 50 

gca gat gga act ate tat aat eta aca ggg gat gtc tea ate acc aat 3 07 
Ala Asp Gly Thr lie Tyr Asn Leu Thr Gly Asp Val Ser lie Thr Asn 
55 60 65 

gca gga tct ccg aca get eta acc get tec tgc ttt aaa gaa act act 3 55 
Ala Gly Ser Pro Thr Ala Leu Thr Ala Ser Cys Phe Lys Glu Thr Thr 
70 75 80 85 

ggg aat ctt tct ttc eaa gge eac gge tac caa ttt etc eta caa aat 4 03 
Gly Asn Leu Ser Phe Gin Gly His Gly Tyr Gin Phe Leu Leu Gin Asn 
90 95 100 

ate gat gcg gga gcg aac tgt acc ttt ace aat aea get gca aat aag 451 
lie Asp Ala Gly Ala Asn Cys Thr Phe Thr Asn Thr Ala Ala Asn Lys 
105 110 115 

ett etc tec ttt tea gga ttc tee tat ttg tea eta ata eaa acc aeg 499 
Leu Leu Ser Phe Ser Gly Phe Ser Tyr Leu Ser Leu lie Gin Thr Thr 
120 125 130 

aat get acc aca gga aca gga gee ate aag tec aea gga get tgt tct 547 
Asn Ala Thr Thr Gly Thr Gly Ala lie Lys Ser Thr Gly Ala Cys Ser 
135 140 145 

att eag teg aac tat agt tgc tac ttt gge caa aac ttt tct aat gae 5 95 
lie Gin Ser Asn Tyr Ser Cys Tyr Phe Gly Gin Asn Phe Ser Asn Asp 
150 155 160 165 

aat gga gge gee etc eaa gge age tct ate agt eta teg eta aac ccc 643 
Asn Gly Gly Ala Leu Gin Gly Ser Ser lie Ser Leu Ser Leu Asn Pro 
170 175 180 



aac eta aeg ttt gee aaa aae aaa gca aeg caa aaa ggg ggt gee etc 
Asn Leu Thr Phe Ala Lys Asn Lys Ala Thr Gin Lys Gly Gly Ala Leu 
185 190 195 



691 
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tat tec acg gga ggg att aca att aac aat acg tta aac tea gca tea 739 
Tyr Ser Thr Gly Gly lie Thr lie Asn Asn Thr Leu Asn Ser Ala Ser 
200 205 210 

ttt tet gaa aat ace gcg gcg aac aat ggc gga gee att tac acg gaa 787 
Phe Ser Glu Asn Thr Ala Ala Asn Asn Gly Gly Ala lie Tyr Thr Glu 
215 220 225 

get age agt ttt att age age aae aaa gea att age ttt ata aac aat 835 
Ala Ser Ser Phe lie Ser Ser Asn Lys Ala lie Ser Phe lie Asn Asn 
230 235 240 245 

agt gtg acc gca acc tea get aca ggg gga gee att tac tgt agt agt 883 
Ser Val Thr Ala Thr Ser Ala Thr Gly Gly Ala lie Tyr Cys Ser Ser 
250 255 260 

aca tea gee ccc aaa cea gte tta act eta tea gae aae ggg gaa etg 931 
Thr Ser Ala Pro Lys Pro Val Leu Thr Leu Ser Asp Asn Gly Glu Leu 
265 270 275 

aac ttt ata gga aat aca gca att act agt ggt ggg gcg att tat act 979 
Asn Phe lie Gly Asn Thr Ala lie Thr Ser Gly Gly Ala lie Tyr Thr 
280 285 290 

gae aat eta gtt ctt tet tct gga gga ect acg ett ttt aaa aac aae 1027 
Asp Asn Leu Val Leu Ser Ser Gly Gly Pro Thr Leu Phe Lys Asn Asn 
295 300 305 

tct ggc tat gat act gca get ccc tta gga gga gca att gcg att get 1075 
Ser Gly Tyr Asp Thr Ala Ala Pro Leu Gly Gly Ala lie Ala lie Ala 
310 315 320 325 

gae tct gga tct ttg agt ctt teg get ctt ggt gga gae ate act ttt 1123 
Asp Ser Gly Ser Leu Ser Leu Ser Ala Leu Gly Gly Asp lie Thr Phe 
330 335 340 

gaa gga aac aca gta gtc aaa gga get tet teg agt cag acc act acc 1171 
Glu Gly Asn Thr Val Val Lys Gly Ala Ser Ser Ser Gin Thr Thr Thr 
345 350 355 

aga aat tct att aac ate gga aac ace aat get aag att gta cag etg 1219 
Arg Asn Ser lie Asn lie Gly Asn Thr Asn Ala Lys lie Val Gin Leu 
360 365 370 

ega gee tet eaa ggc aat act ate tac ttc tat gat ect ata aca act 1267 
Arg Ala Ser Gin Gly Asn Thr lie Tyr Phe Tyr Asp Pro lie Thr Thr 
375 380 385 

age ate act gca get etc tea gat get eta aae tta aat ggt ect gae 1315 
Ser lie Thr Ala Ala Leu Ser Asp Ala Leu Asn Leu Asn Gly Pro Asp 
390 395 - 400 405 



Ctt gca ggg aat ect gea tat caa gga acc ate gta ttt 
Leu Ala Gly Asn Pro Ala Tyr Gin Gly Thr lie Val Phe 
410 415 



tct gga gag 
Ser Gly Glu 
420 



1363 
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aag etc teg gaa gea gaa get gea gaa get gat aat ete aaa tet aca 
Lys Leu Ser Glu Ala Glu Ala Ala Glu Ala Asp Asn Leu Lys Ser Thr 
425 430 435 



1411 



att cag eaa eet eta aet ctt geg gga ggg eaa ete tet ett aaa tea 
lie Gin Gin Pro Leu Thr Leu Ala Gly Gly Gin Leu Ser Leu Lys Ser 
440 445 450 



1459 



gga gte act eta gtt get aag tee ttt teg eaa tet eeg gge tet ace 1507 
Gly Val Thr Leu Val Ala Lys Ser Phe Ser Gin Ser Pro Gly Ser Thr 
455 460 465 

etc etc atg gat gea ggg aee aca tta gaa ace get gat ggg ate act 1555 
Leu Leu Met Asp Ala Gly Thr Thr Leu Glu Thr Ala Asp Gly lie Thr 
470 475 480 485 

ate aat aat ett gtt etc aat gta gat tec tta aaa gag ace aag aag 1603 
lie Asn Asn Leu Val Leu Asn Val Asp Ser Leu Lys Glu Thr Lys Lys 
490 495 500 

gge acg eta aaa gea aea eaa gea agt cag aca gte aet tta tet gga 1651 
Gly Thr Leu Lys Ala Thr Gin Ala Ser Gin Thr Val Thr Leu Ser Gly 
505 510 515 

teg etc tet ett gta gat cet tet gga aat gte tac gaa gat gte tet 1699 
Ser Leu Ser Leu Val Asp Pro Ser Gly Asn Val Tyr Glu Asp Val Ser 
520 525 530 

tgg aat aac cet caa gte ttt tet tgt etc aet ett aet get gac gae 1747 
Trp Asn Asn Pro Gin Val Phe Ser Cys Leu Thr Leu Thr Ala Asp Asp 
535 540 545 

cce geg aat att eac ate aea gae tta get get gat eee eta gaa aaa 1795 
Pro Ala Asn lie His lie Thr Asp Leu Ala Ala Asp Pro Leu Glu Lys 
550 555 560 565 

aat cet ate eat tgg gga tae eaa ggg aat tgg gea tta tet tgg eaa 1843 
Asn Pro lie His Trp Gly Tyr Gin Gly Asn Trp Ala Leu Ser Trp Gin 
570 575 580 

gag gat aet geg aet aaa tec aaa gea geg act ctt ace tgg aea aaa 1891 
Glu Asp Thr Ala Thr Lys Ser Lys Ala Ala Thr Leu Thr Trp Thr Lys 
585 590 595 

aca gga tae aat eeg aat cet gag cgt cgt gga aee tta gtt get aac 1939 
Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg Gly Thr Leu Val Ala Asn 
600 605 610 

aeg eta tgg gga tec ttt gtt gat gtg ege tee ata caa cag ctt gta 1987 
Thr Leu Trp Gly Ser Phe Val Asp Val Arg Ser lie Gin Gin Leu Val 
615 620 625 



gee aet aaa gta cgc eaa tet caa gaa aet ege gge ate tgg tgt gaa 
Ala Thr Lys Val Arg Gin Ser Gin Glu Thr Arg Gly lie Trp Cys Glu 
630 635 640 645 



2035 
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ggg ate teg aac ttc ttc cat aaa gat age acg aag ata aat aaa ggt 2083 

Gly lie Ser Asn Phe Phe His Lys Asp Ser Thr Lys lie Asn Lys Gly 
650 655 660 

ttt cgc cac ata agt gca ggt tat gtt gta gga gcg act aca aca tta 2131 

Phe Arg His lie Ser Ala Gly Tyr Val Val Gly Ala Thr Thr Thr Leu 

665 670 675 

get tct gat aat ctt ate act gca gee ttc tgc caa tta ttc ggg aaa 2179 

Ala Ser Asp Asn Leu lie Thr Ala Ala Phe Cys Gin Leu Phe Gly Lys 
680 685 690 

gat aga gat cac ttt ata aat aaa aat aga get tct gee tat gca get 222 7 

Asp Arg Asp His Phe lie Asn Lys Asn Arg Ala Ser Ala Tyr Ala Ala 
695 700 705 

tct etc cat etc cag cat eta gcg ace ttg tct tct eca age ttg tta 2275 

Ser Leu His Leu Gin His Leu Ala Thr Leu Ser Ser Pro Ser Leu Leu 
710 715 720 725 

cgc tac ctt cet gga tct gaa agt gag cag cet gtc etc ttt gat get 2323 

Arg Tyr Leu Pro Gly Ser Glu Ser Glu Gin Pro Val Leu Phe Asp Ala 
730 735 740 

cag ate age tat ate tat agt aaa aat act atg aaa ace tat tac ace 2371 

Gin lie Ser Tyr lie Tyr Ser Lys Asn Thr Met Lys Thr Tyr Tyr Thr 

745 750 755 

caa gca eca aag gga gag age teg tgg tat aat gae ggt tgc get etg 2419 

Gin Ala Pro Lys Gly Glu Ser Ser Trp Tyr Asn Asp Gly Cys Ala Leu 
760 765 770 

gaa ctt gcg age tec eta eca cac act get tta age cat gag ggt etc 2467 

Glu Leu Ala Ser Ser Leu Pro His Thr Ala Leu Ser His Glu Gly Leu 
775 780 785 

ttc cac gcg tat ttt cet ttc ate aaa gta gaa get teg tae ata cac 2515 

Phe His Ala Tyr Phe Pro Phe lie Lys Val Glu Ala Ser Tyr lie His 
790 795 800 805 

caa gat age ttc aaa gaa cgt aat act ace ttg gta ega tct ttc gat 2563 

Gin Asp Ser Phe Lys Glu Arg Asn Thr Thr Leu Val Arg Ser Phe Asp 
810 815 820 

age ggt gat tta att aac gtc tct gtg cet att gga att acc ttc gag 2611 

Ser Gly Asp Leu lie Asn Val Ser Val Pro lie Gly lie Thr Phe Glu 

825 830 835 

aga ttc teg aga aac gag cgt gcg tct tac gaa get act gtc ate tac 2659 

Arg Phe Ser Arg Asn Glu Arg Ala Ser Tyr Glu Ala Thr Val lie Tyr 
840 845 850 



gtt gee gat gtc tat cgt aag aat cet gae tgc acg aca get etc eta 2707 
Val Ala Asp Val Tyr Arg Lys Asn Pro Asp Cys Thr Thr Ala Leu Leu 
855 860 865 



/ ^ 
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ate aac aat acc teg tgg aaa aet aca gga acg aat ete tea aga eaa 2755 
lie Asn Asn Thr Ser Trp Lys Thr Thr Gly Thr Asn Leu Ser Arg Gin 
870 875 880 885 

get ggt ate gga aga gca ggg ate ttt tat gee tte tct eea aat ctt 2803 
Ala Gly lie Gly Arg Ala Gly lie Phe Tyr Ala Phe Ser Pro Asn Leu 
890 895 900 

gag gte aea agt aac eta tet atg gaa att cgt gga tet tea cge age 2851 
Glu Val Thr Ser Asn Leu Ser Met Glu lie Arg Gly Ser Ser Arg Ser 
905 910 915 

tae aat gca gat ctt gga ggt aag tte eag tte taaaagegtt cetgateeet 2 904 
Tyr Asn Ala Asp Leu Gly Gly Lys Phe Gin Phe 
920 925 



tagaaattet aagagateet gagtgtatet agggacttct caaaga 



2950 
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Figure 6 A: Restriction enzyme analysis of CPNl 00638 

Tsp509I 

Hinfl Bfal Msel | 

Tfil Msel Spell BsmAI Mnll || CviRI 

I I II I I II I 

TGTAGATTCTTAACTTACTAGTCTCTCCTTTCCTCTTGCTTTCTTTAATTTATTGCAGTA 

ACATCTAAGAATTGAATGATCAGAGAGGAAAGGAGAACGAAAGAAATTAAATAACGTCAT 

MboII 

Msel CviJI Mnll Mnll 

Tsp509I HphI I Sfcl I Nlallll Earl | 

III II II II 
TGTGGTGAAATAATTTGTTAAACCACCTATAGCCCTCTACATGAAATCCTCTCTTCATTG 
61 + + + + + + 

ACACCACTTTATTAAACAATTTGGTGGATATCGGGAGATGTACTTTAGGAGAGAAGTAAC 



Bbvl 
Apol I 
Tsp509I I Fnu4HI 
Maelll I I Mwol| 

Msel Tsp45I I I Tselj 

I I I I II 

GTTTTTAATCTCGTCATCTTTAGCACTTCCCTTGTCACTAAATTTCTCTGCGTTTGCTGC 

121 + + + + + + 180 

CAAAAATTAGAGCAGTAGAAATCGTGAAGGGAACAGTGATTTAAAGAGACGCAAACGACG 



NlalV 
BseRI 
Bpml 
ScrFI 
EcoRII 

Avail Avail 
ECO0109I BspGI 
PspSII Sau96I 
Sau96I Hpyl78IIl| | | | | BsaXI 

Sse8647I Alul II I I I|A1oI| 

Bfal I CviJI I i I I II Ppil I 

II I II I I II II 

TGTTGTTGAAATCAATCTAGGACCTACCAATAGCTTCTCTGGACCAGGAACCTACACTCC 

181 + + + + + + 240 

ACAACAACTTTAGTTAGATCCTGGATGGTTATCGAAGAGACCTGGTCCTTGGATGTGAGG 



Bed 

Mnll Tthllllll 
CviJI I CviRI I I HphI BsmAI 

II I II II 

TCCAGCCCAAACAACAAATGCAGATGGAACTATCTATAATCTAACAGGGGATGTCTCAAT 

241 + + + + + + 300 

AGGTCGGGTTTGTTGTTTACGTCTACCTTGATAGATATTAGATTGTCCCCTACAGAGTTA 



Fig. 6 B 
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Alul 
CviJI 
Alwl 
Hpyl8 8IX 
Dpnl 



Bsri 
Hinf I 
Tf il 
Bmrl I 



BstYI I II I Acelll Dral 

Sau3AI j II I Acil I Mselj 
Fokl CviRI II II I Mwol j | Mmel | | 

I I I I I I I III III II 

CACCAATGCAGGATCTCCGACAGCTCTAACCGCTTCCTGCTTTAAAGAAACTACTGGGAA 

301 + + + + + + 360 

GTGGTTACGTCCTAGAGGCTGTCGAGATTGGCGAAGGACGAAATTTCTTTGATGACCCTT 



BsaJI 
BstDSI 
CviJI 
Hael 
Haelll 
BsaJI I 
Styl I 
I I 



Tsp509I 
CviJI I 

I I 



Clal 
TaqI 
Faul 
Sthl32I I 
Sf aNI I I 

Beef I I 



Acil 
I 



TCTTTCTTTCCAAGGCCACGGCTACCAATTTCTCCTACAAAATATCGATGCGGGAGCGAA 

361 + + + + + + 420 

AGAAAGAAAGGTTCCGGTGCCGATGGTTAAAGAGGATGTTTTATAGCTACGCCCTCGCTT 



Bbvl 
Rsal I 
Taal I I 
I I I 



Alul 
CviJI 
Hindlll 
CviRI 
Fnu4HI 

Alul 
CviJI 
MspAlI 
PvuII 
Tsel 



Hinf I 
Tf il 
Hpyl78III I 
I I 



Maelll 
Tsp45I 



CTGTACCTTTACCAATACAGCTGCAAATAAGCTTCTCTCCTTTTCAGGATTCTCCTATTT 

421 + + + + + + 480 

GACATGGAAATGGTTATGTCGACGTTTATTCGAAGAGAGGAAAAGTCCTAAGAGGATAAA 



Bed 

CviJI I Alul 
Bsmi NlalVl I CviJI 

I III I 

GTCACTAATACAAACCACGAATGCTACCACAGGAACAGGAGCCATCAAGTCCACAGGAGC 

CAGTGATTATGTTTGGTGCTTACGATGGTGTCCTTGTCCTCGGTAGTTCAGGTGTCCTCG 



540 
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541 



CviJI 

Hael 
Haelll 

MscI 
Eael I 
Mwol I I 

Taqi Sfcl CjePI | | j Mnll 

II I II I I 

TTGTTCTATTCAGTCGAACTATAGTTGCTACTTTGGCCAAAACTTTTCTAATGACAATGG 

AACAAGATAAGTCAGCTTGATATCAACGATGAAACCGGTTTTGAAAAGATTACTGTTACC 



600 



BsaJI 
Haell 
Hhal 



Hin4I 
NlalV 
BsaHI 
Narl 
Bani I 
CjePI I 
I I 



Alul 
CviJI 
Mnll I 
Fnu4HI I I 
Styl Tsel I I I 
I MM 



Acelll 
Bbvl I 
BplI I I 

I I I 



CjePI 
I 



Acll 
Mae 1 1 
I 



AGGCGCCCTCCAAGGCAGCTCTATCAGTCTATCGCTAAACCCCAACCTAACGTTTGCCAA 
TCCGCGGGAGGTTCCGTCGAGATAGTCAGATAGCGATTTGGGGTTGGATTGCAAACGGTT 



660 



Sthl32I 
Bspl286I 
Bmgl 
BseSI 
NlaIV| 
BanI I I 
I II 



BslI 
BscGI 
Mnll 
BsaJI I 
BstDSI I 
Mnll I I 

II I 



Msel 
Tsp509I I 

I I 



AAACAAAGCAACGCAAAAAGGGGGTGCCCTCTATTCCACGGGAGGGATTACAATTAACAA 

661 + + + + + + 720 

TTTGTTTCGTTGCGTTTTTCCCCCACGGGAGATAAGGTGCCCTCCCTAATGTTAATTGTT 



Fnu4HI 
SacII 
Taul 
Acil I 
MspAlI I 

Thai I CviJI 
BseMII Acil I I NlaIV| 

Hpyl88IX BsaJI j | Ecil | | 

Ddel SfaNl| BstDSI jj Acil| || 

I II III II II 

TACGTTAAACTCAGCATCATTTTCTGAAAATACCGCGGCGAACAATGGCGGAGCCATTTA 

ATGCAATTTGAGTCGTAGTAAAAGACTTTTATGGCGCCGCTTGTTACCGCCTCGGTAAAT 



Msel 
Maell I 

I I 



721 



780 




^ Fig. 6 D 
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J 



CacSI 
Bfal 
Alul I 

Cvi JI I I Fnu4HI Bbvl Alul Maelll 

Nheljj Tsel| Tsp509I CviJI Tsp45I 

III II I I I 

CACGGAAGCTAGCAGTTTTATTAGCAGCAACAAAGCAATTAGCTTTATAAACAATAGTGT 

781 + + + + + + 840 

GTGCCTTCGATCGTCAAAATAATCGTCGTTGTTTCGTTAATCGAAATATTTGTTATCACA 



CviJI 
NlalV 
BseMII 
Mnll 
Cjel 
Sf cl 
Alul 
CviJI 
Mwol 
BbvCI I 

BpulOlj II II III Taal Rsal Cjel 

Acil Ddelj II II III Sfclj TatI | CviJI | BsrI 

I II II II III II I I II I 

GACCGCAACCTCAGCTACAGGGGGAGCCATTTACTGTAGTAGTACATCAGCCCCCAAACC 

841 + + + + + + 

CTGGCGTTGGAGTCGATGTCCCCCTCGGTAAATGACATCATCATGTAGTCGGGGGTTTGG 



900 



Hpyl88IX Tsp509I Bfal 

Msel Sthl32I I BscGI Cjel | Spel | 

I I I I I I II 

AGTCTTAACTCTATCAGACAACGGGGAACTGAACTTTATAGGAAATACAGCAATTACTAG 

901 + + + + + + 960 

TCAGAATTGAGATAGTCTGTTGCCCCTTGACTTGAAATATCCTTTATGTCGTTAATGATC 



Cjel 



MboII 
Bfal I 
II 



Avail 
EC00109I 
PspSII 
Sau96I 
Sse8647I 
Hpyl78III I 
Mnll I I 

I I I 



Bpml 
Dral I 
Msel I I 

II I 



TGGTGGGGCGATTTATACTGACAATCTAGTTCTTTCTTCTGGAGGACCTACGCTTTTTAA 

961 + + + + + + 1020 

ACCACCCCGCTAAATATGACTGTTAGATCAAGAAAGAAGACCTCCTGGATGCGAAAAATT 
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Muni 
Tsp509I 
Acelll 
BSU36I 
Ddel 
Mnll 



CviJI 
I 



Alul 
CviJI 
PstI 
BsaXI 
Fnu4HI 
CviRI I 
Tsel I 
Sfcl I I 

I II 



Bbvl 
BslI 



HaelV 
BseRI Hin4I 
Plel|HinfI I 
II II 



AAACAACTCTGGCTATGATACTGCAGCTCCCTTAGGAGGAGCAATTGCGATTGCTGACTC 

1021 + + + + + + 1080 

TTTGTTGAGACCGATACTATGACGTCGAGGGAATCCTCCTCGTTAACGCTAACGACTGAG 

Dpnl 

BstYI I BsmAI 

Sau3AI I Alwl CviJI | 

Hpyl78IIl| I Hinfl Plel | | Taal 

II I I I I I I 

TGGATCTTTGAGTCTTTCGGCTCTTGGTGGAGACATCACTTTTGAAGGAAACACAGTAGT 

1081 + + + + + + 1140 

ACCTAGAAACTCAGAAAGCCGAGAACCACCTCTGTAGTGAAAACTTCCTTTGTGTCATCA 



Hpyl8 8IX 
Msel I 



Alul Hpyl88IX 
CviJI Hinfl | Apol 
MboII I TaqI I jPlel Tsp509I 

II I I I I I 

CAAAGGAGCTTCTTCGAGTCAGACCACTACCAGAAATTCTATTAACATCGGAAACACCAA 

1141 + + + + + + 1200 

GTTTCCTCGAAGAAGCTCAGTCTGGTGATGGTCTTTAAGATAATTGTAGCCTTTGTGGTT 



CviJI 
Cac8I 
Fnu4HI 

Alul 
CviJI 
MspAlI 
PvuII 
Bce83I I 

Rsal I I II Dpnl 
Bbvl BsrGI I 11 II SauSAI | 

Ddel TatI |Tsel| j jsmll Mnll Alwl j | 

I I I II I I I I I II 

TGCTAAGATTGTACAGCTGCGAGCCTCTCAAGGCAATACTATCTACTTCTATGATCCTAT 

1201 + + + + + + 1260 

ACGATTCTAACATGTCGACGCTCGGAGAGTTCCGTTATGATAGATGAAGATACTAGGATA 
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Bfal 



Acelll 
Bbvl 
Hpyl8 8IX 
Alul 
CviJI 
PstI 



Sf aNI 
TspRI 
Fnu4HI 
Sf aNI 
CviRI 
Tsel 
BtsI I 
Sfcl 



Ddel 



Hpyl78III 
Msel Avail I 
BseMII |Sau96I j CviRI 

II II I 



AACAACTAGCATCACTGCAGCTCTCTCAGATGCTCTAAACTTAAATGGTCCTGACCTTGC 

1261 + + + + + + 1320 

TTGTTGATCGTAGTGACGTCGAGAGAGTCTACGAGATTTGAATTTACCAGGACTGGAACG 



HpylBBIX 
Bbvl 
Alul 

Bed CviJI 
Hinfl Drdll | Hin4l| 

Tfil CviRI NlaIV| j Hpyl78III Hin4I | j | | Bpml 

II III I I II 

AGGGAATCCTGCATATCAAGGAACCATCGTATTTTCTGGAGAGAAGCTCTCGGAAGCAGA 

1321 + + + + + + 1380 

TCCCTTAGGACGTATAGTTCCTTGGTAGCATAAAAGACCTCTCTTCGAGAGCCTTCGTCT 



Alul 
AlwNI 
CviJI 
Mwol 
PstI 
CviRI 
BplI 
Fnu4HI 
Sfcl 
Alul 
AlwNI 
BplI 
BstAPI 

CviJI III II I Acil 
Mwol III II I Faul Mnll | 

Tseljjl 11 I Tsp509I Sthl32 I | Mnll | | 

MM III I II II I 

AGCTGCAGAAGCTGATAATCTCAAATCTACAATTCAGCAACCTCTAACTCTTGCGGGAGG 

1381 + + + + + + 1440 

TCGACGTCTTCGACTATTAGAGTTTAGATGTTAAGTCGTTGGAGATTGAGAACGCCCTCC 



Fig. 6 G 
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Hin4I 
I 



Maelll 
Tsp45I 
Hinf I 
BplI I 
Hpyl78III I I 
Msel I I I 

I III 



Bfal 
Plel 



CviJI 
BseRI 
Neil I 
ScrFI I 
Mspl I I 
II I 



Ddel Sthl32I 
I I 

GCAACTCTCTCTTAAATCAGGAGTCACTCTAGTTGCTAAGTCCTTTTCGCAATCTCCGGG 

1441 + + + + + + 1500 

CGTTGAGAGAGAATTTAGTCCTCAGTGAGATCAACGATTCAGGAAAAGCGTTAGAGGCCC 



SfaNI 
Banll I 
Bspl286I I 
I I 



Fokl 
Cjel 
NlalV 
Avail 
Sau96I 
CviRI 
Mnll 
Mnll I 
Nlallll I 
BslI II I 
I II I 



Alwl 
BsaBI 
Dpnl 

MspAlI Sau3AI | 
Acil I Bed I I 
BsmFI I |Bsll| I I 
III II II 



CTCTACCCTCCTCATGGATGCAGGGACCACATTAGAAACCGCTGATGGGATCACTATCAA 

1501 + + + + + + 1560 

GAGATGGGAGGAGTACCTACGTCCCTGGTGTAATCTTTGGCGACTACCCTAGTGATAGTT 



Bsal 
Hinfl BsmAI 
Tfil Msel 



Bspl286I 
CacSI 
Bmgl I 

Cjel Tfil Msel BseSI j Bsbl 

I II II I 

TAATCTTGTTCTCAATGTAGATTCCTTAAAAGAGACCAAGAAGGGCACGCTAAAAGCAAC 

1561 + + + + + + 1620 

ATTAGAACAAGAGTTACATCTAAGGAATTTTCTCTGGTTCTTCCCGTGCGATTTTCGTTG 



Tthlllll 
Maelll 
Taal 
Tsp45I 
Hpyl8 8IX I 
Mwol Cjel I I 

I III 



Hpyl78III 
Dpnl 
BstYI 



Sau3AI 
Cjel I 
Alwl I I 
Alwl I I I 



Dpnl 
SauSAI I 
Hpyl78IIl| I 
II I 

ACAAGCAAGTCAGACAGTCACTTTATCTGGATCGCTCTCTCTTGTAGATCCTTCTGGAAA 

1621 + + + + + + 1680 

TGTTCGTTCAGTCTGTCAGTGAAATAGACCTAGCGAGAGAGAACATCTAGGAAGACCTTT 



Pf 111081 
Acci I 
CjePlI I 
I I I 



MboII 
BsmAI I 
Bce83I I I 
I II 



CjePI 
Smll I Mnll 

I I I 



BsmAI 



TGTCTACGAAGATGTCTCTTGGAATAACCCTCAAGTCTTTTCTTGTCTCACTCTTACTGC 

1681 + + + + + + 1740 

ACAGATGCTTCTACAGAGAACCTTATTGGGAGTTCAGAAAAGAACAGAGTGAGAATGACG 



Fig. 6 H 
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Faul 
sspi 

Sthl32I I 
Thai I I 
SimI Acil I I I 
I I I II 



Dpnl 
Sau3AI 
Fnu4HI 
Alul 



Alwl I 
Cvi JI I 
Tsel I 

Bbvl Ddel II II Bf al 

I I II I I I 

TGACGACCCCGCGAATATTCACATCACAGACTTAGCTGCTGATCCCCTAGAAAAAAATCC 

1741 + + + + + + 1800 

ACTGCTGGGGCGCTTATAAGTGTAGTGTCTGAATCGACGACTAGGGGATCTTTTTTTAGG 



Tsp509I 

BsaJI I BciVI 

BciVI Styl I Mnll | 

I II II 

TATCCATTGGGGATACCAAGGGAATTGGGCATTATCTTGGCAAGAGGATACTGCGACTAA 

1801 + + + + + + 1860 

ATAGGTAACCCCTATGGTTCCCTTAACCCGTAATAGAACCGTTCTCCTATGACGCTGATT 



BpulOI 
Ddel 
Hpyl78III 
Hinf I 
Tf il 
Hgal I 
HpylSBIX I 
BseMII I I 

I I I 



Hinf I 
Fnu4HI I 
Tsel I I 
Pleljl I 
III I 



BspGI 
ScrFI I 
Bbvl I I 
EcoRII I I BciVI 
III I 



ATCCAAAGCAGCGACTCTTACCTGGACAAAAACAGGATACAATCCGAATCCTGAGCGTCG 

1861 + + + + + + 1920 

TAGGTTTCGTCGCTGAGAATGGACCTGTTTTTGTCCTATGTTAGGCTTAGGACTCGCAGC 



Dpnl 
NlalV 
BamHI 

Cjel BstYI 
Ddel I Sau3AI 
NlalV I I Alwl I 

I II II 
TGGAACCTTAGTTGCTAACACGCTATGGGGATCCTTTGTTGATGTGCGCTCCATACAACA 

1921 + + + 4- + + 1980 

ACCTTGGAATCAACGATTGTGCGATACCCCTAGGAAACAACTACACGCGAGGTATGTTGT 



Hhal 
Hin4I I 
Alwl Cjel I I 

I III 



Alul Bce83I Hpyl78III 

CviJI CviJI I Rsal Smll | 

I I I I II II I 

GCTTGTAGCCACTAAAGTACGCCAATCTCAAGAAACTCGCGGCATCTGGTGTGAAGGGAT 

1981 + + + + + + 2040 

CGAACATCGGTGATTTCATGCGGTTAGAGTTCTTTGAGCGCCGTAGACCACACTTCCCTA 



Fnu4HI 
Taul 
Acil I 
Thai I 
I I 



Dpnl 
BstYI I 
Sau3AI I 
SfaNI I I 
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Fig- 6 I 



XmnI 
Alwl 
Hpyl78III I 
Taqi I 
MboIl| I 
II I 



2041 



MslI 

MboII BspMI |CviRI 

I III 
CTCGAACTTCTTCCATAAAGATAGCACGAAGATAAATAAAGGTTTTCGCCACATAAGTGC 
+ + + + + _,. 2100 

GAGCTTGAAGAAGGTATTTCTATCGTGCTTCTATTTATTTCCAAAAGCGGTGTATTCACG 



CviJI 
PstI I 
TspRI I 
Fnu4HI I 
CviRI I 

Hpyl8 8IX Tsell 
Alul I BtsI I 

Bsgl CviJI I Sfcl I 

I II II 

AGGTTATGTTGTAGGAGCGACTACAACATTAGCTTCTGATAATCTTATCACTGCAGCCTT 

2101 + + + + + + 2160 

TCCAATACAACATCCTCGCTGATGTTGTAATCGAAGACTATTAGAATAGTGACGTCGGAA 



Bbvl Dpnl 
Tsp5 09I Sau3AI I Alul 

Sthl32I |Hpyl78III BsaBI | | CviJI 

II I II I I 

CTGCCAATTATTCGGGAAAGATAGAGATCACTTTATAAATAAAAATAGAGCTTCTGCCTA 

2161 + + + + + + 2220 

GACGGTTAATAAGCCCTTTCTATCTCTAGTGAAATATTTATTTTTATCTCGAAGACGGAT 



Alul 
CviJI 
Fnu4HI 

Bpml 
CviRI 
Mwol 
Tsel 

I 



2221 



Tthllll 
SfaNI 
Bbsl I 

Bbvl MboII I 

Bed Bfal I I 

I III 

TGCAGCTTCTCTCCATCTCCAGCATCTAGCGACCTTGTCTTCTCCAAGCTTGTTACGCTA 
+ + + + + + 2280 

ACGTCGAAGAGAGGTAGAGGTCGTAGATCGCTGGAACAGAAGAGGTTCGAACAATGCGAT 



Mwol 
Maelll 
Alul I 
CviJI I 
Hindi I I I I 
I I I 



Hpyl8 8IX 
Dpnl 
BstYI 
Sau3AI 
ScrFI I 
EcoRII I I 
I I I 



Sfcl 
BseMII 
Alul 
CviJI 
Dpnl 



SfaNI SauSAI 

CviJI I Hpyl8 8IX| 

Fnu4HI I I Ddel I I 

Alwl Tsel I I I Bbvl Mnll | | | 

I II I I I II II 

CCTTCCTGGATCTGAAAGTGAGCAGCCTGTCCTCTTTGATGCTCAGATCAGCTATATCTA 

2281 + + + + + + 2340 

GGAAGGACCTAGACTTTCACTCGTCGGACAGGAGAAACTACGAGTCTAGTCGATATAGAT 
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Fig. 6 J 



2341 



Banll 
BsiHKAI 
Bspl286I 
Sad 
BssSI 
Alul 
CviJI 
Tthlllll I 
CjePI Taqll | j 



TAGTAAAAATACTATGAAAACCTATTACACCCAAGCACCAAAGGGAGAGAGCTCGTGGTA 

+ + + + + + 

ATCATTTTTATGATACTTTTGGATAATGTGGGTTCGTGGTTTCCCTCTCTCGAGCACCAT 



2400 



Hpyl78III 
Hhal I 
CjePI I I 
Taal I I I 
I I I I 



Banll 
BsiHKAI 
Bspl286I 
Sad 
Alul I 
CviJI I 
CacBI I I 
I I I 



Nlalll 
MboII 
CviJI 
Mnll 



Msel 
TspRI I 
BtsI I I 

I I I 



I 



TAATGACGGTTGCGCTCTGGAACTTGCGAGCTCCCTACCACACACTGCTTTAAGCCATGA 

2401 + + + + + + 2460 

ATTACTGCCAACGCGAGACCTTGAACGCTCGAGGGATGGTGTGTGACGAAATTCGGTACT 



Thai 
Af llll 
Mlul 
Earl I 
Bsal I I 
SimI BsmAI j j 

I III 



Alul 
CviJI 
Hindlll I 



Rsal 



I 



GGGTCTCTTCCACGCGTATTTTCCTTTCATCAAAGTAGAAGCTTCGTACATACACCAAGA 

2461 + + + + + + 2520 

CCCAGAGAAGGTGCGCATAAAAGGAAAGTAGTTTCATCTTCGAAGCATGTATGTGGTTCT 



Mae 1 1 
HphI 
Msel 

Dpnl Pad 
Alul BsaJI Sau3AI | Tsp509I | 

CviJI Maell Styl Rsal | j TaqI Acil Msel | j 

I I I I I I I I III 

TAGCTTCAAAGAACGTAATACTACCTTGGTACGATCTTTCGATAGCGGTGATTTAATTAA 

2521 + + + + + + 2580 

ATCGAAGTTTCTTGCATTATGATGGAACCATGCTAGAAAGCTATCGCCACTAAATTAATT 
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Hgal 
Hpyl78III 
Hpyl78III 
TaqI 
Aval I 
Smll I 
Hin4I I I 
Hinf I I I I I BplI 
BsmAI Hpyl78III |Xhol| j | CacSI | 

BsmBI Tsp509I TaqI |TfiI || | | Hin4l| 

I I I I I II I I II 

CGTCTCTGTGCCTATTGGAATTACCTTCGAGAGATTCTCGAGAAACGAGCGTGCGTCTTA 

2581 + + + + + + 2640 

GCAGAGACACGGATAACCTTAATGGAAGCTCTCTAAGAGCTCTTTGCTCGCACGCAGAAT 



Hpyl78III 

Alul Bsgl Hinfl | Alul 

CviJI Taal Maell BsaBI | Tfil | CviRI CviJI 

II I I I I I I I 

CGAAGCTACTGTCATCTACGTTGCCGATGTCTATCGTAAGAATCCTGACTGCACGACAGC 

2641 + + + + + + 2700 

GCTTCGATGACAGTAGATGCAACGGCTACAGATAGCATTCTTAGGACTGACGTGCTGTCG 



Acelll 



BSSSI 



Bce83I Hpyl78III 

Sfcl| Hinfl I Alul 

Mnll I I Tfil Smll | CviJI 

III III I 



TCTCCTAATCAACAATACCTCGTGGAAAACTACAGGAACGAATCTCTCAAGACAAGCTGG 

2701 + + + + + + 2760 

AGAGGATTAGTTGTTATGGAGCACCTTTTGATGTCCTTGCTTAGAGAGTTCTGTTCGACC 



MboII Maelll 

Dpnl I Tsp4 5I 

Hpyl88IX BstYI | | Hpyl78III | 

Earl I Sau3AI j | Smll | | 

Sapl I Hin4I I j I Alwl Mnll | | | Maelll 

II I I II I I II I I 

TATCGGAAGAGCAGGGATCTTTTATGCCTTCTCTCCAAATCTTGAGGTCACAAGTAACCT 

2761 + + + + + -f 2820 

ATAGCCTTCTCGTCCCTAGAAAATACGGAAGAGAGGTTTAGAACTCCAGTGTTCATTGGA 



Dpnl 
BstYI 
Sau3AI 
MboII 
Apol 



Tsp509I 
BslI I 
Bce83I I I 
I I I 



Alul 
CviJI 
Fnu4HI I 
Tsel| I 
Alwl I I I 

I II I 



Dpnl 
Mnll 
Bglll 
BstYI 
Sau3AI 
Bbvl I 
CviRI I I 

I I I 



Bsri 



ATCTATGGAAATTCGTGGATCTTCACGCAGCTACAATGCAGATCTTGGAGGTAAGTTCCA 

2821 + + + + + + 2880 

TAGATACCTTTAAGCACCTAGAAGTGCGTCGATGTTACGTCTAGAACCTCCATTCAAGGT 



50/77 



Fig. 6 L 



REPLACEMENT SHEET 
Title; CHLAMYDIA ANTIGENS AND 
CORRESPONDING DNA FRAGMENTS AND 
USES THEREOF 
Inventor(s): Andrew D. MURDIN et al. 
Appl. No.: 09/857,128 



Ddel 
Hpyl78III 
Dpnl 
BstYI 
Sau3AI 
Dpnl BseMII 
Sau3AI I Apol I 

Hpyl78IIl| I Tsp509I Alwl | 
Alwl II I Ddel I Ddel I | | | Bfal 

I II I I I II 

GTTCTAAAAGCGTTCCTGATCCCTTAGAAATTCTAAGAGATCCTGAGTGTATCTAGGGAC 

2881 + + + + + + 2940 

CAAGATTTTCGCAAGGACTAGGGAATCTTTAAGATTCTCTAGGACTCACATAGATCCCTG 



BsmFI 



TTCTCAAAGA 

2941 + 2950 

AAGAGTTTCT 
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gtacgaagtt cttcacgaaa ttataataca aacctaggct ctaagttttg tttctagatt 60 

atcgaaaacg tgttaattaa ttgaacccaa gcatctttct atg aaa ata ccc ttg 115 

Met Lys lie Pro Leu 
1 5 



cac aaa etc ctg ate tct teg act ctt gtc act ccc att eta ttg age 
His Lys Leu Leu lie Ser Ser Thr Leu Val Thr Pro lie Leu Leu Ser 
10 15 20 



163 



att gea act tac gga gca gat get tct tta tec ect aea gat age ttt 
lie Ala Thr Tyr Gly Ala Asp Ala Ser Leu Ser Pro Thr Asp Ser Phe 
25 30 35 



211 



gat gga gcg ggc ggc tct aca ttt act cca aaa tct aea gea gat gee 
Asp Gly Ala Gly Gly Ser Thr Phe Thr Pro Lys Ser Thr Ala Asp Ala 
40 45 50 



259 



aat gga aeg aac tat gtc tta tea gga aat gtc tat ata aac gat get 
Asn Gly Thr Asn Tyr Val Leu Ser Gly Asn Val Tyr lie Asn Asp Ala 
55 60 65 



307 



ggg aaa ggc aca gca tta aea ggc tge tge ttt aca gaa act aeg ggt 
Gly Lys Gly Thr Ala Leu Thr Gly Cys Cys Phe Thr Glu Thr Thr Gly 
70 75 80 85 



355 



gat ctg aca ttt act gga aag gga tac tea ttt tea ttc aac aeg gta 
Asp Leu Thr Phe Thr Gly Lys Gly Tyr Ser Phe Ser Phe Asn Thr Val 
90 95 100 



403 



gat gcg ggt teg aat gea gga get gcg gca age aca act get gat aaa 
Asp Ala Gly Ser Asn Ala Gly Ala Ala Ala Ser Thr Thr Ala Asp Lys 
105 110 115 



451 



gee eta ate ttc aca gga ttt tct aac ctt tec ttc att gea get cet 
Ala Leu lie Phe Thr Gly Phe Ser Asn Leu Ser Phe lie Ala Ala Pro 
120 125 130 



499 



gga act aca gtt get tea gga aaa agt act tta agt tct gca gga gee 
Gly Thr Thr Val Ala Ser Gly Lys Ser Thr Leu Ser Ser Ala Gly Ala 
135 140 145 



547 



tta aat ctt ace gat aat gga aeg att etc ttt age caa aac gtc tec 
Leu Asn Leu Thr Asp Asn Gly Thr lie Leu Phe Ser Gin Asn Val Ser 
150 155 160 165 



595 



aat gaa get aat aac aat ggc gga gcg ate ace aea aaa act ctt tct 
Asn Glu Ala Asn Asn Asn Gly Gly Ala lie Thr Thr Lys Thr Leu Ser 
170 175 180 



643 



att tct ggg aat ace tct tct ata ace ttc act agt aat age gea aaa 
lie Ser Gly Asn Thr Ser Ser lie Thr Phe Thr Ser Asn Ser Ala Lys 
185 190 195 



691 



/ 
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aaa tta ggt 
Lys Leu Gly 
200 

aac acc ggc 
Asn Thr Gly 
215 

ggg get ctg 
Gly Ala Leu 
230 

ctt ttc ttc 
Leu Phe Phe 



gcc att tat 
Ala lie Tyr 



gga aat aaa 
Gly Asn Lys 
280 

gga gca ate 
Gly Ala lie 
295 

eta ttt tea 
Leu Phe Ser 
3.10 

get att gea 
Ala lie Ala 



gga gae ate 
Gly Asp lie 



aea teg aca 
Thr Ser Thr 
360 

aac tta agg 
Asn Leu Arg 
375 

gca tct aac 
Ala Ser Asn 
390 

gat age aac 
Asp Ser Asn 



gga geg ate 
Gly Ala lie 



cag tta gtc 
Gin Leu Val 



gge ttt gaa 
Gly Phe Glu 
235 

tet gga aae 
Ser Gly Asn 
250 

tgt gaa aaa 
Cys Glu Lys 
265 

agt ctg acc 
Ser Leu Thr 



tgt gcc cat 
Cys Ala His 



aat aat aga 
Asn Asn Arg 
315 

att gee gae 
lie Ala Asp 
330 

acg ttc ctt 
Thr Phe Leu 
345 

egg aat get 
Arg Asn Ala 



gca gee caa 
Ala Ala Gin 



acc aca gga 
Thr Thr Gly 
395 

teg cct tta 
Ser Pro Leu 
410 



tat age tct 
Tyr Ser Ser 
205 

ttt atg aat 
Phe Met Asn 
220 

gee age tee 
Ala Ser Ser 



act gca aea 
Thr Ala Thr 



aea gga gag 
Thr Gly Glu 
270 

ttc gcc gag 
Phe Ala Glu 
285 

ggt eta gat 
Gly Leu Asp 
300 

tge ggg aae 
Cys Gly Asn 



tet gga tet 
Ser Gly Ser 



ggc aac act 
Gly Asn Thr 
350 

ate tae ctg 
lie Tyr Leu 
365 

ggc caa tct 
Gly Gin Ser 
380 

get tea gae 
Ala Ser Asp 



gat tat tea 
Asp Tyr Ser 



geg get gea 
Ala Ala Ala 



aat aaa gga 
Asn Lys Gly 
225 

teg att act 
Ser lie Thr 
240 

gat get gea 
Asp Ala Ala 
255 

act cct act 
Thr Pro Thr 



aac tct tea 
Asn Ser Ser 



ctt tee get 
Leu Ser Ala 
305 

aea get gea 
Thr Ala Ala 
320 

tta agt etc 
Leu Ser Leu 
335 

eta acc tea 
Leu Thr Ser 



gga teg tea 
Gly Ser Ser 



ate tat ttc 
lie Tyr Phe 
385 

gtt ctg acc 
Val Leu Thr 
400 

gga acg att 
Gly Thr lie 
415 



agt att tea 
Ser lie Ser 
210 

gaa act ggg 
Glu Thr Gly 



caa aat age 
Gin Asn Ser 



ggc aag ggc 
Gly Lys Gly 
260 

ctt act ate 
Leu Thr lie 
275 

gta act caa 
Val Thr Gin 
290 

get gge cct 
Ala Gly Pro 



ggc aag ggc 
Gly Lys Gly 



tet gca aat 
Ser Ala Asn 
340 

ace tee geg 
Thr Ser Ala 
355 

gca aaa att 
Ala Lys lie 
370 

tat gat eeg 
Tyr Asp Pro 



ate aac caa 
lie Asn Gin 



gta ttt tct 
Val Phe Ser 
420 



gga 73 9 
Gly 



ggt 787 
Gly 



tec 835 

Ser 

245 

ggg 883 
Gly 



tct 931 
Ser 



ggc 97 9 
Gly 



acc 1027 
Thr 



gge 1075 

Gly 

325 

caa 1123 
Gin 



cea 1171 
Pro 



acg 1219 
Thr 



att 1267 
He 



ecg 1315 

Pro 

405 

ggg 13 63 
Gly 
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gaa aag etc tct gca gat gaa gcg aaa get get gat aae tte aea tet 
Glu Lys Leu Ser Ala Asp Glu Ala Lys Ala Ala Asp Asn Phe Thr Ser 
425 430 435 



1411 



ata tta aag caa cca ttg get eta gee tct gga ace tta gca etc aaa 
lie Leu Lys Gin Pro Leu Ala Leu Ala Ser Gly Thr Leu Ala Leu Lys 
440 445 450 



1459 



gga aat gtc gag tta gat gtc aat ggt ttc aca cag act gaa ggc tct 
Gly Asn Val Glu Leu Asp Val Asn Gly Phe Thr Gin Thr Glu Gly Ser 
455 460 465 



1507 



aca etc etc atg caa cca gga aca aag etc aaa gca gat act gaa get 
Thr Leu Leu Met Gin Pro Gly Thr Lys Leu Lys Ala Asp Thr Glu Ala 
470 475 480 485 



1555 



ate agt ett ace aaa ett gtc gtt gat ctt tet gee tta gag gga aat 
lie Ser Leu Thr Lys Leu Val Val Asp Leu Ser Ala Leu Glu Gly Asn 
490 495 500 



1603 



aag agt gtg tec att gaa aca gca gga gee aae aaa act ata act eta 
Lys Ser Val Ser lie Glu Thr Ala Gly Ala Asn Lys Thr lie Thr Leu 
505 510 515 



1651 



acc tct cet ctt gtt ttc caa gat agt age ggc aat ttt tat gaa age 
Thr Ser Pro Leu Val Phe Gin Asp Ser Ser Gly Asn Phe Tyr Glu Ser 
520 525 530 



1699 



cat acg ata aac caa gcc tte aeg cag cet ttg gtg gta tte act get 
His Thr lie Asn Gin Ala Phe Thr Gin Pro Leu Val Val Phe Thr Ala 
535 540 545 



1747 



get act get get age gat att tat ate gat gcg ctt etc act tct cca 
Ala Thr Ala Ala Ser Asp lie Tyr lie Asp Ala Leu Leu Thr Ser Pro 
550 555 560 565 



1795 



gta caa act cca gaa cet cat tac ggg tat cag gga cat tgg gaa gee 
Val Gin Thr Pro Glu Pro His Tyr Gly Tyr Gin Gly His Trp Glu Ala 
570 575 580 



1843 



act tgg gca gac aca tea act gca aaa tea gga act atg act tgg gta 
Thr Trp Ala Asp Thr Ser Thr Ala Lys Ser Gly Thr Met Thr Trp Val 
585 590 595 



1891 



act acg ggc tac aac cet aat cet gag cgt aga get tec gta gtt cec 
Thr Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg Ala Ser Val Val Pro 
600 605 610 



1939 



gat tea tta tgg gca tec ttt act gac att cgc act eta cag cag ate 
Asp Ser Leu Trp Ala Ser Phe Thr Asp lie Arg Thr Leu Gin Gin lie 
615 620 625 

atg aea tct caa gcg aat agt ate tat cag caa cga gga etc tgg gca 
Met Thr Ser Gin Ala Asn Ser lie Tyr Gin Gin Arg Gly Leu Trp Ala 
630 635 640 645 



1987 



2035 



( 
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tea gga act gcg aat ttc ttc cat aag gat aaa tea gga act aac caa 
Ser Gly Thr Ala Asn Phe Phe His Lys Asp Lys Ser Gly Thr Asn Gin 
650 655 660 



2083 



gca ttc cga cat aaa age tac ggc tat att gtt gga gga agt get gaa 
Ala Phe Arg His Lys Ser Tyr Gly Tyr lie Val Gly Gly Ser Ala Glu 
665 670 675 



2131 



gat ttt tct gaa aat ate ttc agt gta get ttc tgc cag etc ttc ggt 
Asp Phe Ser Glu Asn lie Phe Ser Val Ala Phe Cys Gin Leu Phe Gly 
680 685 690 



2179 



aaa gat aaa gac ctg ttt ata gtt gaa aat ace tct cat aac tat tta 
Lys Asp Lys Asp Leu Phe lie Val Glu Asn Thr Ser His Asn Tyr Leu 
695 700 705 



2227 



gcg teg eta tac ctg caa cat cga gca ttc eta gga gga ctt cec atg 
Ala Ser Leu Tyr Leu Gin His Arg Ala Phe Leu Gly Gly Leu Pro Met 
710 715 720 725 



2275 



cec tea ttt gga agt ate ace gac atg ctg aaa gat att cet etc att 
Pro Ser Phe Gly Ser lie Thr Asp Met Leu Lys Asp lie Pro Leu lie 
730 735 740 



2323 



ttg aat gee cag eta age tac age tac act aaa aat gat atg gat act 
Leu Asn Ala Gin Leu Ser Tyr Ser Tyr Thr Lys Asn Asp Met Asp Thr 
745 750 755 



2371 



cge tat act tec tat cet gaa get caa ggc tct tgg ace aat aac tct 
Arg Tyr Thr Ser Tyr Pro Glu Ala Gin Gly Ser Trp Thr Asn Asn Ser 
760 765 770 



2419 



999 get eta gag etc gga gga tct ctg get eta tat etc cet aaa gaa 
Gly Ala Leu Glu Leu Gly Gly Ser Leu Ala Leu Tyr Leu Pro Lys Glu 
775 780 785 



2467 



gca ccg ttc ttc cag gga tat ttc cec ttc tta aag ttc cag gca gtc 
Ala Pro Phe Phe Gin Gly Tyr Phe Pro Phe Leu Lys Phe Gin Ala Val 
790 795 800 805 



2515 



tac age cge caa caa aac ttt aaa gag agt ggc get gaa gee egt get 
Tyr Ser Arg Gin Gin Asn Phe Lys Glu Ser Gly Ala Glu Ala Arg Ala 
810 815 820 



2563 



ttt gat gat gga gac eta gtg aac tgc tct ate cet gtc ggc att egg 
Phe Asp Asp Gly Asp Leu Val Asn Cys Ser lie Pro Val Gly lie Arg 
825 830 835 



2611 



tta gaa aaa ate tec gaa gat gaa aaa aat aat ttc gag att tct eta 
Leu Glu Lys lie Ser Glu Asp Glu Lys Asn Asn Phe Glu lie Ser Leu 
840 845 850 



2659 



gee tac att ggt gat gtg tat egt aaa aat cec egt teg cgt act tct 
Ala Tyr lie Gly Asp Val Tyr Arg Lys Asn Pro Arg Ser Arg Thr Ser 
855 860 865 



2707 
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eta atg gtc agt gga gcc tct tgg act teg eta tgt aaa aae cte gca 2755 

Leu Met Val Ser Gly Ala Ser Trp Thr Ser Leu Cys Lys Asn Leu Ala 
870 875 880 885 

cga caa gcc ttc tta gca agt get gga age cat etg act etc tec cct 2803 

Arg Gin Ala Phe Leu Ala Ser Ala Gly Ser His Leu Thr Leu Ser Pro 
890 895 900 

eat gta gaa etc tct ggg gaa get get tat gag ctt cgt ggc tea gca 2 851 

His Val Glu Leu Ser Gly Glu Ala Ala Tyr Glu Leu Arg Gly Ser Ala 
905 910 915 

cac ate tac aat gta gat tgt ggg eta aga tac tea ttc tagttectae 2900 

His lie Tyr Asn Val Asp Cys Gly Leu Arg Tyr Ser Phe 
920 925 930 

ttteetccct aaacttttag ggaggaatte ttataaaaae eetgtagatt cttaaettae 2960 



tagtctetee tttectcttg etttetttaa tttattgeag 



3000 
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Figure 8 A: Restriction enzyme analysis of CPNl 00639 



Ddel 
CviJI 
Bf al I 
Avrll| I I Hpyl78III 
I BsaJI I I I Bfal I 

I Styl I j I Xbal I I 

I I I I II I I III 
GTACGAAGTTCTTCACGAAATTATAATACAAACCTAGGCTCTAAGTTTTGTTTCTAGATT 
+ + + + + ^ 

CATGCTTCAAGAAGTGCTTTAATATTATGTTTGGATCCGAGATTCAAAACAAAGATCTAA 



Tsp509I 
MboII Hpyl78III I 

Rsal XmnI I 



60 



Tsp509I 
Msel 
Pad 
Vspl 
Tsp509I I 

Afllll I I I Tthlllll 

TaqI Maell Msel | || SfaNI | CviRI 

I I II II II I 

ATCGAAAACGTGTTAATTAATTGAACCCAAGCATCTTTCTATGAAAATACCCTTGCACAA 
61 + + + + + ^ -^20 

TAGCTTTTGCACAATTAATTAACTTGGGTTCGTAGAAAGATACTTTTATGGGAACGTGTT 



Plel 
Dpnl 
Sau3AI I 
Hpyl78IIl| I 
MboII I I I 
I II I 



Maelll 
Tsp4 5I 
PshAI 
Earl I 
Hinfl I 
TaqI I I 

I I I 



CviRI 
BsrDI I SfaNI 

I I I 

ACTCCTGATCTCTTCGACTCTTGTCACTCCCATTCTATTGAGCATTGCAACTTACGGAGC 

121 + + + + + + 180 

TGAGGACTAGAGAAGCTGAGAACAGTGAGGGTAAGATAACTCGTAACGTTGAATGCCTCG 



CviJI 
Fnu4HI 
Taul 
Acil 
CacSI 
Acil 
BsrBI 
Bed 
Faul 
Sthl32I I 
Alul I I 

Sfcl CviJI II I I I II I BsaXI 

I I II I I I II I I 

AGATGCTTCTTTATCCCCTACAGATAGCTTTGATGGAGCGGGCGGCTCTACATTTACTCC 

181 + + + + + + 240 

TCTACGAAGAAATAGGGGATGTCTATCGAAACTACCTCGCCCGCCGAGATGTAAATGAGG 
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Sfcl 

SfaNI I Hpyl78III SfaNI 

II II 
AAAATCTACAGCAGATGCCAATGGAACGAACTATGTCTTATCAGGAAATGTCTATATAAA 

241 + + + + + + 300 

TTTTAGATGTCGTCTACGGTTACCTTGCTTGATACAGAATAGTCCTTTACAGATATATTT 



301 



Fnu4HI 
Tsel 

Msel Fnu4HI I 
Bbvl I CviJI I I I 
Bbvl I I Tsel I II Sthl32I 

III III 
CGATGCTGGGAAAGGCACAGCATTAACAGGCTGCTGCTTTACAGAAACTACGGGTGATCT 

GCTACGACCCTTTCCGTGTCGTAATTGTCCGACGACGAAATGTCTTTGATGCCCACTAGA 



Hpyl8 8IX 
Dpnl 
Sau3AI 
BscGI 
Bsp24I 
Cjel 
CjePI 
I 



360 



361 



Faul NspV 

Taal TaqI 

Cjel Sthl32l| Bbvl I 

Bsri CjePI I Bsbl || Acelll | j 

HphI BciVI I Bsp24l| | SfaNI | | j Acil | | j CviRI 

III III I I II I III I 

GACATTTACTGGAAAGGGATACTCATTTTCATTCAACACGGTAGATGCGGGTTCGAATGC 

CTGTAAATGACCTTTCCCTATGAGTAAAAGTAAGTTGTGCCATCTACGCCCAAGCTTACG 



420 



CacSI 
Fnu4HI 
Taul 
Acil 
Fnu4HI 
Alul 



AlwNI 
BstAPI 
CviJI 
Mwol 
Tsel 
BsmI I 

I I 



CviJI 
MboII I 
Mwol I I 
Tthlllll I I I 
II II 

AGGAGCTGCGGCAAGCACAACTGCTGATAAAGCCCTAATCTTCACAGGATTTTCTAACCT 

421 + + + + + + 480 

TCCTCGACGCCGTTCGTGTTGACGACTATTTCGGGATTAGAAGTGTCCTAAAAGATTGGA 



Fig. 8 C 
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EC057I 
ScrFI 
AlwNI 
ECORII 
Alul 
CviJI 

Fnu4HI I I I lAcelll Rsal 
CviRI I I III Sfcl HpylVSIII Seal CviRI 

BsrDI Tsell | ||| Bbvl j Taal | TatI jMsel Sfcl | 

III I III II 

TTCCTTCATTGCAGCTCCTGGAACTACAGTTGCTTCAGGAAAAAGTACTTTAAGTTCTGC 

481 + + + + + + 

AAGGAAGTAACGTCGAGGACCTTGATGTCAACGAAGTCCTTTTTCATGAT^TTCAAGACG 



540 



Msel 
Cjel 
CviJI I 
Nlaiv| I 
PstI I I I 
I II I 



CjePI 
Hinfl I 
Tf il 



BsmAI 
BsmBI 



541 



I CviJI 
I Cjel I Maell 

I I II I I 

AGGAGCCTTAAATCTTACCGATAATGGAACGATTCTCTTTAGCCAAAACGTCTCCAATGA 

TCCTCGGAATTTAGAATGGCTATTACCTTGCTAAGAGAAATCGGTTTTGCAGAGGTTACT 



600 



Alul 
CviJI 
I 



601 



Dpnl 
Sau3AI 
CjePI 
Ecil I 

HphI I I I CjePI 
CjePI Acil| I I I MboIl| 

I II I I I II 

AGCTAATAACAATGGCGGAGCGATCACCACAAAAACTCTTTCTATTTCTGGGAATACCTC 

TCGATTATTGTTACCGCCTCGCTAGTGGTGTTTTTGAGAAAGATAAAGACCCTTATGGAG 



660 



Bbvl 

Sfcl Fnu4HI 

Mnll Bfal Tsp509I Dpnl | Alul Taul 

Earl I Spell Hhal | Sau3AI | | CviJI Acil | 

II II II I I I I II 

TTCTATAACCTTCACTAGTAATAGCGCAAAAAAATTAGGTGGAGCGATCTATAGCTCTGC 

661 + + + + + + 

AAGATATTGGAAGTGATCATTATCGCGTTTTTTTAATCCACCTCGCTAGATATCGAGACG 



720 



CviRI 
Fnu4HI I 
CviJI I I 
Tselj I 
II I 



Hpyl78III 



Bsri 
CviJI 
Haelll 
Eael 
Gdill 
Mspl I 
BsrFI I I 
I I I 



I 

GGCTGCAAGTATTTCAGGAAACACCGGCCAGTTAGTCTTTATGAATAATAAAGGAGAAAC 

721 + + + + + + 780 

CCGACGTTCATAAAGTCCTTTGTGGCCGGTCAATCAGAAATACTTATTATTTCCTCTTTG 



Fig. 8 D 
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BsrI 
Bmrl I 

I I 



Alul 
CviJI 
Cac8I 
CviJI 

Banll Cjel | 

Bspl286I BseRl| I 
CviJI I CviJI I I I 

II I II I 



Mnll 
Acelll I 
Taqi I I 

I I I 



Mboll 
Alul I 
CviJI I 



TGGGGGTGGGGCTCTGGGCTTTGAAGCCAGCTCCTCGATTACTCAAAATAGCTCCCTTTT 

781 + + + + + + 

ACCCCCACCCCGAGACCCGAAACTTCGGTCGAGGAGCTAATGAGTTTTATCGAGGGAAAA 



840 



Faul 
CacSI 
Pstl 
Sthl32I 
CviRl 
Fnu4HI 
Sfcl 
AlwNI 



Tsel 
BstAPI 
TspRI I 
CviRI 



BtsI I 
Bbvl I I 
Hpyl78III SfaNI j j 

I III 



Mwol 



CviJI 
Haelll 
NlalV 
Sau96I I 
Acil I I 
Cjel I I I 

I I II 



CTTCTCTGGAAACACTGCAACAGATGCTGCAGGCAAGGGCGGGGCCATTTATTGTGAAAA 

841 + + + + + + 

GAAGAGACCTTTGTGACGTTGTCTACGACGTCCGTTCCCGCCCCGGTAAATAACACTTTT 



900 



Hin4I 

Cjel I EC057I 

Plel Cjel II Cjel I Bce83I 

BsmAI I Hinf I | | Hpyl78III Hpyl88IX j | MboII | 

I I I II I I II I I 

AACAGGAGAGACTCCTACTCTTACTATCTCTGGAAATAAAAGTCTGACCTTCGCCGAGAA 

901 + + + + -f- + 960 

TTGTCCTCTCTGAGGATGAGAATGATAGAGACCTTTATTTTCAGACTGGAAGCGGCTCTT 
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Dpnl 
Bglll 
BstYI 
Sau3AI 
Hpyl78III 
Bf al 
Bbvl 
Xbal 
Nlalll 



BsaJI 
Bspl286I 
BstDSI 

NCOl 

Styl 
Bmgl 
BseSI 
BstAPI I 
Mwol I 



Earl Ecil 
Maelll Smll Acil | 

I I II I 

CTCTTCAGTAACTCAAGGCGGAGCAATCTGTGCCCATGGTCTAGATCTTTCCGCTGCTGG 

961 + + + + + + 

GAGAAGTCATTGAGTTCCGCCTCGTTAGACACGGGTACCAGATCTAGAAAGGCGACGACC 



CviJI 
Haelll 
Sau96I 
CacSI 
Fnu4HI 
MspAlI I 
Tsel I 
Acil I I 
I II 



1020 



Faul 
Sthl32I I 
SfaNI I I 

I II 



Acil 
Bbvl I 



CacSI 
PstI 
CviRI 
Fnu4HI 
Sf cl 
Alul 
CviJI 
MspAlI 
PvuII 
Tsel 
I 



Fnu4HI 
Taul 
Acil [Haell 
Mwol |HhaI I 



CCCTACCCTATTTTCAAATAATAGATGCGGGAACACAGCTGCAGGCAAGGGCGGCGCTAT 

1021 + + + + + + 1080 

GGGATGGGATAAAAGTTTATTATCTACGCCCTTGTGTCGACGTCCGTTCCCGCCGCGATA 



Plel 
Muni I 
Tsp509I I 
CviRI I I 
II I 



Dpnl 
BstYI 
Sau3AI 
Hpyl78III 
Haeiv| 
Hin4I I 
Hinf I I I 
I II 



BsmAI 

Alwl CviRI I BsaJI 

[Msel I BsmAI I |CjeI Maell Styl 

II I I I I II 

TGCAATTGCCGACTCTGGATCTTTAAGTCTCTCTGCAAATCAAGGAGACATCACGTTCCT 

1081 + + + + + + 

ACGTTAACGGCTGAGACCTAGAAATTCAGAGAGACGTTTAGTTCCTCTGTAGTGCAAGGA 



1140 
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Bsbl 



Cjel 



Mnll 
Hhal 
Thai I 
Mnll] I 
Acilll I 
III I 



Taqi 



Sau3AI 
ScrFI 
BsaJI I 
EcoRII I I 
Bsmi I I I 

I Ml 



TGGCAACACTCTAACCTCAACCTCCGCGCCAACATCGACACGGAATGCTATCTACCTGGG 

1141 + + + + + + 

ACCGTTGTGAGATTGGAGTTGGAGGCGCGGTTGTAGCTGTGCCTTACGATAGATGGACCC 



1200 



Bbvl 
CviJI 
Hael 
Haelll 
BsaJI 
Styl 
CviJI 
Fnu4HI 
Tsel 
CjePI 
Msel I 
Afllll I 
Smllj I 

I I I 

ATCGTCAGCAAAAATTACGAACTTAAGGGCAGCCCAAGGCCAATCTATCTATTTCTATGA 

1201 + + + + + + 

TAGCAGTCGTTTTTAATGCTTGAATTCCCGTCGGGTTCCGGTTAGATAGATAAAGATACT 



Tsp509I 
Dpnl Alwl I 

I I I 



Dpnl 
CjePI I 
Sau3AI I i 
Alwl I I I 

I III 



1260 



Hpyl8 8IX 
Mae 1 1 
Hpyl8 8IX I 
EC057I Alul I j 

Hpyl88IX CviRI | Sf aNI CviJI | | | Bed 

I III I I I I I 

TCCGATTGCATCTAACACCACAGGAGCTTCAGACGTTCTGACCATCAACCAACCGGATAG 

1261 + + + + + + 1320 

AGGCTAACGTAGATTGTGGTGTCCTCGAAGTCTGCAAGACTGGTAGTTGGTTGGCCTATC 



Mspl 
BsaWI I Hin4I 
II I 



Bbvl 
PstI 

Alul CviRI I 

Hpyl78III CviJI Sfcl | | 

I I II I 

CAACTCGCCTTTAGATTATTCAGGAACGATTGTATTTTCTGGGGAAAAGCTCTCTGCAGA 

1321 + + + + + + 

GTTGAGCGGAAATCTAATAAGTCCTTGCTAACATAAAAGACCCCTTTTCGAGAGACGTCT 



1380 



Fnu4HI 
Alul I 

CviJI I CviJI 
Mwol I Bfal I 

Mwol Tsel I Msel CviJI | j 

I II I III 

TGAAGCGAAAGCTGCTGATAACTTCACATCTATATTAAAGCAACCATTGGCTCTAGCCTC 

1381 + + + + + + 1440 

ACTTCGCTTTCGACGACTATTGAAGTGTAGATATAATTTCGTTGGTAACCGAGATCGGAG 



Fig. 8 G 
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BpulOI 
Ddel 
Mnll 
NlalV I 

Hpyl78III I I Taqi 
III I 

TGGAACCTTAGCACTCAAAGGAAATGTCGAGTTAGATGTCAATGGTTTCACACAGACTGA 

1441 + + + + + + 1500 

ACCTTGGAATCGTGAGTTTCCTTTACAGCTCAATCTACAGTTACCAAAGTGTGTCTGACT 



ScrFI 
Mnll 
EcoRII 
CviRI I , 

CviJI Nlalll I j I Alul Alul 

BseRI I Eco57l| ||| CviJI AlwNI CviJI 

II II III I II 

AGGCTCTACACTCCTCATGCAACCAGGAACAAAGCTCAAAGCAGATACTGAAGCTATCAG 

1501 + + + + + + 1560 

TCCGAGATGTGAGGAGTACGTTGGTCCTTGTTTCGAGTTTCGTCTATGACTTCGATAGTC 



Dpnl Ddel 
EC05 7I Sau3AI | Mnll | 

I II II 

TCTTACCAAACTTGTCGTTGATCTTTCTGCCTTAGAGGGAAATAAGAGTGTGTCCATTGA 

1561 + + + + + 1620 

AGAATGGTTTGAACAGCAACTAGAAAGACGGAATCTCCCTTTATTCTCACACAGGTAACT 

CviJI 

NlaIV| BseRI Mnll Mnll Acil 

II I III 

AACAGCAGGAGCCAACAAAACTATAACTCTAACCTCTCCTCTTGTTTTCCAAGATAGTAG 

1621 + + + + + + 1680 

TTGTCGTCCTCGGTTGTTTTGATATTGAGATTGGAGAGGAGAACAAAAGGTTCTATCATC 

Tsp509I CviJI 
Fnu4HI I Fnu4HI I Bbvl 

Taul I CviJI CviJI Tsel | | Bbvl | 

II I I II I I I 

CGGCAATTTTTATGAAAGCCATACGATAAACCAAGCCTTCACGCAGCCTTTGGTGGTATT 

1681 + + + + + + 1740 

GCCGTTAAAAATACTTTCGGTATGCTATTTGGTTCGGAAGTGCGTCGGAAACCACCATAA 



TspRI 
Fnu4HI I 
Tsel I I 
Btsi I I I 
Bbvl I 



CacBI 
Bf al 
Nhel 
Fnu4HI 
Mwol I 
Tsel I 
II 
II 



Bpml 
Clal I 

I I I Mwol I I III SfaNI TaqI |HhaI 

I I III I II III I III 

CACTGCTGCTACTGCTGCTAGCGATATTTATATCGATGCGCTTCTCACTTCTCCAGTACA 

1741 + + + + + + 1800 

GTGACGACGATGACGACGATCGCTATAAATATAGCTACGCGAAGAGTGAAGAGGTCATGT 



Rsal 
TatI I 
Bpml BsrI | | 

I III 



CjePI Bce83I 
SfaNI I Sfcl I 
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Mnll 

Sthl32I BscGI I BsmFI Bgll 

Hpyl78III I BslI || CjePI Cvi JI | Mwol 

I I I II I II I 

AACTCCAGAACCTCATTACGGGTATCAGGGACATTGGGAAGCCACTTGGGCAGACACATC 

1801 + + + + + + I860 

TTGAGGTCTTGGAGTAATGCCCATAGTCCCTGTAACCCTTCGGTGAACCCGTCTGTGTAG 

BpulOI 

Hpyl78III Ddel 
CviRI I Maelll CviJI Hpyl78III 

CjePI I I Sthl32l| BscGI |BseMII | 

III II III I 

AACTGCAAAATCAGGAACTATGACTTGGGTAACTACGGGCTACAACCCTAATCCTGAGCG 

1861 + + + + + + 1920 

TTGACGTTTTAGTCCTTGATACTGAACCCATTGATGCCCGATGTTGGGATTAGGACTCGC 

Sthl32I 
Hinf I 
Tf il 
Fokl I 

Alul Hpyl78III I I 

CviJI CjePI I I I 

I I III 

TAGAGCTTCCGTAGTTCCCGATTCATTATGGGCATCCTTTACTGACATTCGCACTCTACA 

1921 + + + + + + 1980 

ATCTCGAAGGCATCAAGGGCTAAGTAATACCCGTAGGAAATGACTGTAAGCGTGAGATGT 

Nlalll 
Hpyl78III 
Real I 

Dpnl I I I Plel 
Sau3AI I I I I Smll Mnll | Hinf I Hpyl78III 

Ml I I I III I 

GCAGATCATGACATCTCAAGCGAATAGTATCTATCAGCAACGAGGACTCTGGGCATCAGG 

1981 + + + + + + 2040 

CGTCTAGTACTGTAGAGTTCGCTTATCATAGATAGTCGTTGCTCCTGAGACCCGTAGTCC 

Apol Alul 

Tsp509I CviJI 

MboII I Tthlllll I 

AlwNI I I Hpyl88IX | | 

SfaNI I jxmnl Hpyl78III BsmI | Mmel | j 

I I I I I I I I I I 

AACTGCGAATTTCTTCCATAAGGATAAATCAGGAACTAACCAAGCATTCCGACATAAAAG 

2041 + + + + + + 2100 

TTGACGCTTAAAGAAGGTATTCCTATTTAGTCCTTGATTGGTTCGTAAGGCTGTATTTTC 

MboII 

MboII I Alul 

Mmel Hpyl88IX| | CviJI 

CviJI Mnll I Beef I Eco57I {|| Eco57I. TspRI | 

I I I I I III I II 

CTACGGCTATATTGTTGGAGGAAGTGCTGAAGATTTTTCTGAAAATATCTTCAGTGTAGC 

2101 + + + + + + 2160 

GATGCCGATATAACAACCTCCTTCACGACTTCTAAAAAGACTTTTATAGAAGTCACATCG 
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Alul 

CviJI Acelll 

Cac8I I Earl | 

MboII I I Sapl I Hgal 

I I I I I I 

TTTCTGCCAGCTCTTCGGTAAAGATAAAGACCTGTTTATAGTTGAAAATACCTCTCATAA 

2161 + + + + + + 2220 

AAAGACGGTCGAGAAGCCATTTCTATTTCTGGACAAATATCAACTTTTATGGAGAGTATT 



Bf al 
Avrll I 
BsaJI I 
Styl I 

CviRI TaqI Mnll 
Mnll Mwol |BspMl|BsmI | || Nlalll 

I I I II I I II I 

CTATTTAGCGTCGCTATACCTGCAACATCGAGCATTCCTAGGAGGACTTCCCATGCCCTC 

2221 + + + + + + 2280 

GATAAATCGCAGCGATATGGACGTTGTAGCTCGTAAGGATCCTCCTGAAGGGTACGGGAG 



Mnll 
HphI I 
BslI I I 
I I I 



Nlalll 
MslI Nspl 

I I 



Mnll 



Alul 
CviJI 
Bpull02I 
Ddel 
Alul I 
CviJI i 
Bsmi I I 

I II 



ATTTGGAAGTATCACCGACATGCTGAAAGATATTCCTCTCATTTTGAATGCCCAGCTAAG 

2281 + + + + + + 2340 

TAAACCTTCATAGTGGCTGTACGACTTTCTATAAGGAGAGTAAAACTTACGGGTCGATTC 



Alul Smll 
CviJI Hpyl78III Alul I 

Sfcl I BciVI Bce83I | CviJI | CviJI 

II I I I II I 

CTACAGCTACACTAAAAATGATATGGATACTCGCTATACTTCCTATCCTGAAGCTCAAGG 

2341 + + + + + + 2400 

GATGTCGATGTGATTTTTACTATACCTATGAGCGATATGAAGGATAGGACTTCGAGTTCC 
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1 



J 



Dpnl 



BstYI 
Sau3AI 
Hpyl8 8IX 

Banll 
BsiHKAI 
Bspl286I 
Sad 
Alul 
CviJI 
Mnll 



EC057I 
Avail I 
Sau96I I 
I I 



CjePI 
Hpyl78III 
Bf al 
Banll 
Bspl286I 
Xbal 
CviJI I 



CviJI 
Alwl I 
II 



CjePI 
I 



2401 



CTCTTGGACCAATAACTCTGGGGCTCTAGAGCTCGGAGGATCTCTGGCTCTATATCTCCC 

+ + + + + + 2460 

GAGAACCTGGTTATTGAGACCCCGAGATCTCGAGCCTCCTAGAGACCGAGATATAGAGGG 



ScrFI 
BsaJI I 

EcoRIlll ScrFI Sfcl 

MboII Taal III Msel EcoRII | Accl| CviJI 

I I III I I I II I 

TAAAGAAGCACCGTTCTTCCAGGGATATTTCCCCTTCTTAAAGTTCCAGGCAGTCTACAG 

2461 + + + + + + 2520 

ATTTCTTCGTGGCAAGAAGGTCCCTATAAAGGGGAAGAATTTCAAGGTCCGTCAGATGTC 



Bed 

Acil Sthl32I Eco57l| 

Fnu4HI Dral Haell BscGI j Bsal | | Bfal 

Taul Msel I Hhal|CviJI | |BsmAI || BsaXI | 

I II II I I I I II II 

CCGCCAACAAAACTTTAAAGAGAGTGGCGCTGAAGCCCGTGCTTTTGATGATGGAGACCT 

2521 + + + + + + 

GGCGGTTGTTTTGAAATTTCTCTCACCGCGACTTCGGGCACGAAAACTACTACCTCTGGA 



2580 



BsmI HpylSSIX MboII 

I I I 

AGTGAACTGCTCTATCCCTGTCGGCATTCGGTTAGAAAAAATCTCCGAAGATGAAAAAAA 

2581 + + + + + + 2640 

TCACTTGACGAGATAGGGACAGCCGTAAGCCAATCTTTTTTAGAGGCTTCTACTTTTTTT 



Hpyl78III CviJI Sthl32I 

Tsp509I TaqI | Bfal | BsaBI HphI BscGI Thai 

III II II II 

TAATTTCGAGATTTCTCTAGCCTACATTGGTGATGTGTATCGTAAAAATCCCCGTTCGCG 

2641 + + + + + + 2700 

ATTAAAGCTCTAAAGAGATCGGATGTAACCACTACACATAGCATTTTTAGGGGCAAGCGC 
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CviJI 

Nlaivl Mnll 
Rsal TspRI I Mnll Mwol 

I II I I 

TACTTCTCTAATGGTCAGTGGAGCCTCTTGGACTTCGCTATGTAAAAACCTCGCACGACA 

2701 + + + + + + 2760 

ATGAAGAGATTACCAGTCACCTCGGAGAACCTGAAGCGATACATTTTTGGAGCGTGCTGT 



Hinf I 
Hpyl88IX 
Bed I 

Mwol Plel I I 

CviJI Ddel I CviJI | j | 

I II III I 



Bbvl 
Mnll I 
Nlalll I I 
I I I 



AGCCTTCTTAGCAAGTGCTGGAAGCCATCTGACTCTCTCCCCTCATGTAGAACTCTCTGG 

2761 + + + + + + 2820 

TCGGAAGAATCGTTCACGACCTTCGGTAGACTGAGAGAGGGGAGTACATCTTGAGAGACC 

Fnu4HI 

Alul| Bpull02I 

CviJI I Alul Ddel RleAI Ddel 

Tselj CviJI CviJI I BseMIll Cvi JI | 

II I II II II 

GGAAGCTGCTTATGAGCTTCGTGGCTCAGCACACATCTACAATGTAGATTGTGGGCTAAG 

2821 + + + + + + 2880 

CCTTCGACGAATACTCGAAGCACCGAGTCGTGTGTAGATGTTACATCTAACACCCGATTC 

Apol 

Mnll EcoRI 
Bfal Mnll I Tsp509I 

I III 

ATACTCATTCTAGTTCCTACTTTCCTCCCTAAACTTTTAGGGAGGAATTCTTATAAAAAC 

2881 + + + + + + 2940 

TATGAGTAAGATCAAGGATGAAAGGAGGGATTTGAAAATCCCTCCTTAAGAATATTTTTG 

Tsp509I 

Hinf I Bfal Msel| 

Sfcl Tfil Msel Spel| BstnAI Mnll || CviRI 

I I I II I I II I 

CCTGTAGATTCTTAACTTACTAGTCTCTCCTTTCCTCTTGCTTTCTTTAATTTATTGCAG 

2941 + + + + + + 3000 

GGACATCTAAGAATTGAATGATCAGAGAGGAAAGGAGAACGAAAGAAATTAAATAACGTC 
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Figure 9 A: Nucleotide and amino acid sequence of CPNl 00708 



ataaagtttt ttatatgaac aaaactttga cttcattggc agacacttcg accttaacag 60 

accaattttg ttgtcatccc tataaaaatc aggaattttc atg etc tec tea eta 115 

Met Leu Ser Ser Leu 
1 5 

ate egt gat tea ttt ecc ctt ctt att tta ctt ccc aca ttc eta gcg 163 

lie Arg Asp Ser Phe Pro Leu Leu lie Leu Leu Pro Thr Phe Leu Ala 
10 15 20 

gca tta gga gee tee gta get ggc ggc gtt atg gga ace tat ate gtt 211 

Ala Leu Gly Ala Ser Val Ala Gly Gly Val Met Gly Thr Tyr lie Val 
25 30 35 

gta aaa egt att gtt tea att agt gga agt ata tct cat gca att eta 259 

Val Lys Arg lie Val Ser lie Ser Gly Ser lie Ser His Ala lie Leu 
40 45 50 

gga gga att ggc etc ace eta tgg ata caa tat aag ctt eat etc tet 3 07 

Gly Gly lie Gly Leu Thr Leu Trp lie Gin Tyr Lys Leu His Leu Ser 
55 60 65 

ttt ttc cct atg tat gga get att gta gga get att ttt eta get ctt 355 

Phe Phe Pro Met Tyr Gly Ala lie Val Gly Ala lie Phe Leu Ala Leu 
70 75 80 85 

tge ate ggc aag ate cac ctg aaa tac caa gaa agg gaa gac tet ttg 4 03 

Cys lie Gly Lys lie His Leu Lys Tyr Gin Glu Arg Glu Asp Ser Leu 

90 95 100 

att gcg atg att tgg tct gtg ggc atg gca att gga att ata ttc att 451 

lie Ala Met lie Trp Ser Val Gly Met Ala lie Gly lie lie Phe lie 
105 110 115 

tee agg ctt ecc ace ttt aat gga gag etc ate aat ttt eta ttt ggg 499 

Ser Arg Leu Pro Thr Phe Asn Gly Glu Leu lie Asn Phe Leu Phe Gly 
120 125 130 

aac att etc tgg gtc acc cct tea gac etc tat age tta gga ate ttt 547 

Asn lie Leu Trp Val Thr Pro Ser Asp Leu Tyr Ser Leu Gly lie Phe 
135 140 145 

gat ctt ctt gtt tta gga att gtg gtc ctt tge cac acc egg ttc ctt 595 

Asp Leu Leu Val Leu Gly lie Val Val Leu Cys His Thr Arg Phe Leu 
150 155 160 165 

get ctt tge ttt gat gag agg tac aeg get tta aac cat tgt tct gta 643 

Ala Leu Cys Phe Asp Glu Arg Tyr Thr Ala Leu Asn His Cys Ser Val 

170 175 180 



eag ctg tgg tat ttc eta ctt ctt gtt ctg aca gca ate acg att gtg 
Gin Leu Trp Tyr Phe Leu Leu Leu Val Leu Thr Ala lie Thr lie Val 
185 190 195 
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atg ttg att tat gtg atg gga acg att ctg atg ctt age atg etc gtc 739 
Met Leu lie Tyr Val Met Gly Thr lie Leu Met Leu Ser Met Leu Val 
200 205 210 

tta act gtt get ata gcg tgt aga ttt teg tac aag atg aca cga att 787 
Leu Pro Val Ala lie Ala Cys Arg Phe Ser Tyr Lys Met Thr Arg lie 
215 220 225 

atg ttc ate teg gtc etc ttg aat ate tta tgt tet ttt tet gga att 835 
Met Phe lie Ser Val Leu Leu Asn lie Leu Cys Ser Phe Ser Gly lie 
230 235 240 245 

tge ate gee tac tgt tta gat ttc cea gta ggt eet aeg ata tea ttg 883 
Cys lie Ala Tyr Cys Leu Asp Phe Pro Val Gly Pro Thr lie Ser Leu 
250 255 260 

etg atg ggg tta ggt tat aca gcg agt ett tgt gtg aag aag egg tac 931 
Leu Met Gly Leu Gly Tyr Thr Ala Ser Leu Cys Val Lys Lys Arg Tyr 
265 270 275 

aat eeg teg acg eet tet eet gta agt ect gaa ate aat aca aat gta 979 
Asn Pro Ser Thr Pro Ser Pro Val Ser Pro Glu lie Asn Thr Asn Val 
280 285 290 



tagctaggga agegcttttg gaagctttgg 
tcatcaattt tatttaaagc taceagcata 
t 



aggcattett cctgttegte aggaagaaga 103 9 
tetttctttt caaaatctgg etgatgagag 1099 

1100 
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Figure 10 A: Restriction enzyme analysis of CPNl 00708 



Bcgl 



TaqI Msel 
I I 



ATAAAGTTTTTTATATGAACAAAACTTTGACTTCATTGGCAGACACTTCGACCTTAACAG 

1 + + + + + + 60 

TATTTCAAAAAATATACTTGTTTTGAAACTGAAGTAACCGTCTGTGAAGCTGGAATTGTC 

BseRI 

Tsp509I Apol I 

Bcgl I Tsp509I I 

Fokl|i Hpyl78III I |NlaIII Mnll 

Ml I I I I I 
ACCAATTTTGTTGTCATCCCTATAAAAATCAGGAATTTTCATGCTCTCCTCACTAATCCG 
61 + + + + + + 120 

TGGTTAAAACAACAGTAGGGATATTTTTAGTCCTTAAAAGTACGAGAGGAGTGATTAGGC 



RleAI 
Fnu4HI 
Taul 

Hinfl Acil| 
Tf il Bfal I I 

I III 
TGATTCATTTCCCCTTCTTATTTTACTTCCCACATTCCTAGCGGCATTAGGAGCCTCCGT 

ACTAAGTAAAGGGGAAGAATAAAATGAAGGGTGTAAGGATCGCCGTAATCCTCGGAGGCA 



BsaXI 
CviJI I 
NlaIV| I 
Mwol I I |MwoI 
I III I 



180 



Fnu4HI 
Taul 
Acil 
Cac8I 
Mnll 
Alul I 

CviJI I I I NlalV Maell Tsp509I 

MM I I I 

AGCTGGCGGCGTTATGGGAACCTATATCGTTGTAAAACGTATTGTTTCAATTAGTGGAAG 

181 + + + + + + 240 

TCGACCGCCGCAATACCCTTGGATATAGCAACATTTTGCATAACAAAGTTAATCACCTTC 



Bfal 
Mnll 
Tsp509I I 
CviRI I I 
Nlalllj I 
II I 



241 



CviJI 
Hael 

Haelll Alul 
HphI I CviJI 

Tsp509l| I BciVI Mnll Hindlll | 

Mill II 

TATATCTCATGCAATTCTAGGAGGAATTGGCCTCACCCTATGGATACAATATAAGCTTCA 
ATATAGAGTACGTTAAGATCCTCCTTAACCGGAGTGGGATACCTATGTTATATTCGAAGT 



300 



BsaXI Alul 
Alul I Alul CviJI 

BslI CviJI j CviJI Bfal | CviRI 

III I II I 

TCTCTCTTTTTTCCCTATGTATGGAGCTATTGTAGGAGCTATTTTTCTAGCTCTTTGCAT 

301 + + + + + + 360 

AGAGAGAAAAAAGGGATACATACCTCGATAACATCCTCGATAAAAAGATCGAGAAACGTA 
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Dpnl 
BstYI 
Sau3AI 

SfaNI I Plel Bbsl 

Alwl I I Bsll| Hinfl 

Mill I 
CGGCAAGATCCACCTGAAATACCAAGAAAGGGAAGACTCTTTGATTGCGATGATTTGGTC 

361 + + + + + + 

GCCGTTCTAGGTGGACTTTATGGTTCTTTCCCTTCTGAGAAACTAACGCTACTAAACCAG 



iMboII RleAI 



420 



Tsp509I 
Muni I 
Tsp509I I 
Nlalll I I 
I I 



CviJI 
Cjel I 
ScrFI I 
EcoRII I I 
I I I 



Msel 
I 



Banll 
BsiHKAI 
Bspl286I 
Sad 
Alul I 
CviJI I 
Hin4I 



TGTGGGCATGGCAATTGGAATTATATTCATTTCCAGGCTTCCCACCTTTAATGGAGAGCT 

421 + + + + + + 

ACACCCGTACCGTTAACCTTAATATAAGTAAAGGTCCGAAGGGTGGAAATTACCTCTCGA 



480 



Hinfl 
Mnll 
BpulOI 

BstEII Ddel 
cjel Maelll Alul | 

Cjel I HphI SimI Cjel Cvi JI j 

Tsp509I I I EC057I |Tsp45I HpylBSIX | Sf cl |||TfiI 

III II I I I I III I 

CATCAATTTTCTATTTGGGAACATTCTCTGGGTCACCCCTTCAGACCTCTATAGCTTAGG 

481 + + + + + + 

GTAGTTAAAAGATAAACCCTTGTAAGAGACCCAGTGGGGAAGTCTGGAGATATCGAATCC 



540 



Dpnl 
Sau3AI I 
MboII I I 

I I I 



Avail 
Cjel 

Tsp509I Sau96I 
I I 



Sthl32I 
NlalV 
Mspl I 
Neil I 
ScrFI I 

I I 



AATCTTTGATCTTCTTGTTTTAGGAATTGTGGTCCTTTGCCACACCCGGTTCCTTGCTCT 

541 + + + + + + 600 

TTAGAAACTAGAAGAACAAAATCCTTAACACCAGGAAACGGTGTGGGCCAAGGAACGAGA 



Mnll 
Cjell 



Dral 
Msel I 
Rsal CviJI I I 
I III 



Alul 
CviJI 
MspAlI 
PvuII 
Rsal 
BsrGI I 
TatI I 
Bcefl I I 
I I I 



TTGCTTTGATGAGAGGTACACGGCTTTAAACCATTGTTCTGTACAGCTGTGGTATTTCCT 

601 + + + + + + 660 

AACGAAACTACTCTCCATGTGCCGAAATTTGGTAACAAGACATGTCGACACCATAAAGGA 
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HpylSSIX 



MslI 

Hpyl78III I 
I I 
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Hpyl88IX 
Hinfl I 
SfaNI I I 
Bcgl Bed |TfiI I 
I I I I I 



ACTTCTTGTTCTGACAGCAATCACGATTGTGATGTTGATTTATGTGATGGGAA.CGATTCT 

661 + + + + + + 720 

TGAAGAACAAGACTGTCGTTAGTGCTAACACTACAACTAAATACACTACCCTTGCTAAGA 



Nlalll 
Nspl 
SphI 

Bpull02I CacSI I 

Ddel Mwol I I Bcgl Sfcl Rsal 

I II I I I I 

GATGCTTAGCATGCTCGTCTTACCTGTTGCTATAGCGTGTAGATTTTCGTACAAGATGAC 

721 + + + + + + 

CTACGAATCGTACGAGCAGAATGGACAACGATATCGCACATCTAAAAGCATGTTCTACTG 



780 



Hpyl78III Apol 
Taqll Avail | Tsp509I 

Tsp509I I Sau96I | Mnll Hpyl78III | CviRI 

II III III 

ACGAATTATGTTCATCTCGGTCCTCTTGAATATCTTATGTTCTTTTTCTGGAATTTGCAT 

781 + + + + + + 840 

TGCTTAATACAAGTAGAGCCAGGAGAACTTATAGAATACAAGAAAAAGACCTTAAACGTA 



841 



BsrDI 
ECORV 

pfiiioei 

Avail 
ECOO109I 
PspBII 
Sau96I 
Sse8647I 
Bcgl I 
Taal Bmrl | j 

SfaNI I Bsrl|| I III Bed Bcgl 

II III I I I I I I 
CGCCTACTGTTTAGATTTCCCAGTAGGTCCTACGATATCATTGCTGATGGGGTTAGGTTA 
+ + + + + + 

GCGGATGACAAATCTAAAGGGTCATCCAGGATGCTATAGTAACGACTACCCCAATCCAAT 



900 



901 



BsaXI 
Hin4I 
BsaHI 
Hindi 
MboII AccI I 
Rsal I TaqI j 

Hinfl Plel Acil | | Sail | | | MHgal Hpyl78III 

I I I I I III I II I I 

TACAGCGAGTCTTTGTGTGAAGAAGCGGTACAATCCGTCGACGCCTTCTCCTGTAAGTCC 

+ + h + + + 

ATGTCGCTCAGAAACACACTTCTTCGCCATGTTAGGCAGCTGCGGAAGAGGACATTCAGG 



960 
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961 



Haell Alul 
Bf al Hhal I CviJI BsmI 

Alul I Eco47IIl|l Mnll| Mwol | 

CviJlj Mwol I I iHindlll 1 |MboII I j 

II I III III II I 
TGAAATCAATACAAATGTATAGCTAGGGAAGCGCTTTTGGAAGCTTTGGAGGCATTCTTC 
+ + + + + + 

ACTTTAGTTATGTTTACATATCGATCCCTTCGCGAAAACCTTCGAAACCTCCGTAAGAAG 



1020 



MboII 
MboII 
Tsp509I 
Dpnl I 
Hpyl78III Sau3AI | | 
I III 



Alul 
CviJI 
Dral I 
Msel| I 
I I I 

CTGTTCGTCAGGAAGAAGATCATCAATTTTATTTAAAGCTACCAGCATATCTTTCTTTTC 

1021 + + - + + + + 1080 

GACAAGCAGTCCTTCTTCTAGTAGTTAAAATAAATTTCGATGGTCGTATAGAAAGAAAAG 



CviJI 



AAAATCTGGCTGATGAGAGT 

1081 + + 1100 

TTTTAGACCGACTACTCTCA 
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Figure lA: Nucleotide and deduced amino acid sequence of CPNl 00634 
gataaaaatt cttgacagct gttttgtcat ctttaacttg atttacttat tttgtttcta 60 



tattgatgcg aatagttctc taaaaaacaa aagcattacc atg aag act teg att 

Met Lys Thr Ser lie 
1 5 

cct tgg gtt tta gtt tec tec gtg tta get ttc tea tgt cac eta eag 
Pro Trp Val Leu Val Ser Ser Val Leu Ala Phe Ser Cys His Leu Gin 
10 15 20 



115 



163 



tea eta get aac gag gaa ctt tta tea eet gat gat age ttt aat gga 211 
Ser Leu Ala Asn Glu Glu Leu Leu Ser Pro Asp Asp Ser Phe Asn Gly 
25 30 35 

aat ate gat tea gga aeg ttt act cca aaa act tea gee aca aea tat 259 
Asn lie Asp Ser Gly Thr Phe Thr Pro Lys Thr Ser Ala Thr Thr Tyr 
40 45 50 

tct eta aea gga gat gte ttc ttt tac gag eet gga aaa gge act eee 307 
Ser Leu Thr Gly Asp Val Phe Phe Tyr Glu Pro Gly Lys Gly Thr Pro 
55 60 65 

tta tct gac agt tgt ttt aag eaa ace aeg gae aat ett aec tte ttg 355 
Leu Ser Asp Ser Cys Phe Lys Gin Thr Thr Asp Asn Leu Thr Phe Leu 
70 75 80 85 

ggg aac ggt cat age tta aeg ttt gge ttt ata gat get ggc act eat 4 03 
Gly Asn Gly His Ser Leu Thr Phe Gly Phe lie Asp Ala Gly Thr His 
90 95 100 

gea ggt get get gea tct aca aca gea aat aag aat ett ace tte tea 451 
Ala Gly Ala Ala Ala Ser Thr Thr Ala Asn Lys Asn Leu Thr Phe Ser 
105 110 115 

ggg ttt tec tta ctg agt ttt gat tec tct cct age aea aeg gtt act 499 
Gly Phe Ser Leu Leu Ser Phe Asp Ser Ser Pro Ser Thr Thr Val Thr 
120 125 130 

aea ggt eag gga aeg ctt tec tea gea gga gge gta aat tta gaa aat 547 
Thr Gly Gin Gly Thr Leu Ser Ser Ala Gly Gly Val Asn Leu Glu Asn 
135 140 145 

att cgt aaa ctt gta gtt get ggg aat ttt tct act gea gat ggt gga 595 
lie Arg Lys Leu Val Val Ala Gly Asn Phe Ser Thr Ala Asp Gly Gly 
150 155 160 165 

get ate aaa gga geg tct ttc ett tta act ggc act tct gga gat get 643 
Ala lie Lys Gly Ala Ser Phe Leu Leu Thr Gly Thr Ser Gly Asp Ala 
170 175 180 



ett ttt agt aac aac tct tea tea aca aag gga gga gea att get act 
Leu Phe Ser Asn Asn Ser Ser Ser Thr Lys Gly Gly Ala lie Ala Thr 
185 190 195 



691 



\ ■ c 



R 
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2/77 



^AOtViFfg. 1 0 (CO n 't) - 



ANNOTATED SHEET SHOWING CHANGES 

Title: CHLAMYDIA ANTIGENS AND 
CORRESPONDING DNA FRAGMENTS AND 
USES THEREOF 
Inventor(s): Andrew D. MURDIN et al. 
Appl. No.: 09/857,128 



aca gca ggc get cgc ata gca aat aac aca ggt tat gtt aga ttc eta 739 
Thr Ala Gly Ala Arg lie Ala Asn Asn Thr Gly Tyr Val Arg Phe Leu 
200 205 210 



tct aac ata gcg tct acg tea gga gge get ate gat gat gaa ggc acg 787 

Ser Asn lie Ala Ser Thr Ser Gly Gly Ala lie Asp Asp Glu Gly Thr 
215 220 225 

teg ata eta teg aac aac aaa ttt eta tat ttt gaa ggg aat gca gcg 835 

Ser lie Leu Ser Asn Asn Lys Phe Leu Tyr Phe Glu Gly Asn Ala Ala 

230 235 240 245 



aaa act act ggc ggt gcg ate tge aac ace aag gcg agt gga tct cct 8 83 
Lys Thr Thr Gly Gly Ala lie Cys Asn Thr Lys Ala Ser Gly Ser Pro 
250 255 260 



gaa ctg ata ate tct aac aat aag act ctg ate ttt get tea aac gta 931 
Glu Leu lie lie Ser Asn Asn Lys Thr Leu lie Phe Ala Ser Asn Val 
265 270 275 



gca gaa aca age ggt ggc gee ate cat get aaa aag eta gee ett tec 979 
Ala Glu Thr Ser Gly Gly Ala lie His Ala Lys Lys Leu Ala Leu Ser 
280 285 290 



tct gga ggc ttt aca gag ttt eta cga aat aat gtc tea tea gca act 1027 
Ser Gly Gly Phe Thr Glu Phe Leu Arg Asn Asn Val Ser Ser Ala Thr 
295 300 305 



cct aag ggg ggt get ate age ate gat gee tea gga gag etc agt ett 1075 
Pro Lys Gly Gly Ala lie Ser lie Asp Ala Ser Gly Glu Leu Ser Leu 
310 315 320 325 



tct gca gag aca gga aac att ace ttt gta aga aat ace ett aca aca 1123 
Ser Ala Glu Thr Gly Asn lie Thr Phe Val Arg Asn Thr Leu Thr Thr 
330 335 340 



acc gga agt ace gat act cct aaa cgt aat gcg ate aac ata gga agt 1171 
Thr Gly Ser Thr Asp Thr Pro Lys Arg Asn Ala lie Asn lie Gly Ser 
345 350 355 



aac ggg aaa ttc acg gaa tta egg get get aaa aat eat aca att ttc 1219 
Asn Gly Lys Phe Thr Glu Leu Arg Ala Ala Lys Asn His Thr lie Phe 
360 365 370 



ttc tat gat cec ate act tea gaa gga ace tea tea gae gta ttg aag 1267 
Phe Tyr Asp Pro lie Thr Ser Glu Gly Thr Ser Ser Asp Val Leu Lys 
375 380 385 



ata aat aac ggc tct gcg gga get etc aat eca tat caa gga acg att 1315 
lie Asn Asn Gly Ser Ala Gly Ala Leu Asn Pro Tyr Gin Gly Thr lie 
390 395 400 405 



eta ttt tct gga gaa acc eta aca gca gat gaa ett aaa gtt get gae 
Leu Phe Ser Gly Glu Thr Leu Thr Ala Asp Glu Leu Lys Val Ala Asp 
410 415 420 



1363 
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aat tta aaa tct tea ttc acg cag cca gtc tec eta tee gga gga aag 
Asn Leu Lys Ser Ser Phe Thr Gin Pro Val Ser Leu Ser Gly Gly Lys 
425 430 435 



1411 



tta ttg eta caa aag gga gtc act tta gag age acg age ttc tct caa 
Leu Leu Leu Gin Lys Gly Val Thr Leu Glu Ser Thr Ser Phe Ser Gin 
440 445 450 



1459 



gag gee ggt tct etc etc ggc atg gat tea gga acg aca tta tea act 
Glu Ala Gly Ser Leu Leu Gly Met Asp Ser Gly Thr Thr Leu Ser Thr 
455 460 465 



1507 



aca get ggg agt att aca ate acg aac eta gga ate aat gtt gae tee 
Thr Ala Gly Ser lie Thr lie Thr Asn Leu Gly lie Asn Val Asp Ser 
470 475 480 485 



1555 



tta ggt ctt aag cag cec gtc age eta aca gea aaa ggt get tea aat 
Leu Gly Leu Lys Gin Pro Val Ser Leu Thr Ala Lys Gly Ala Ser Asn 
490 495 500 



1603 



aaa gtg ate gta tct ggg aag etc aac etg att gat att gaa ggg aac 
Lys Val lie Val Ser Gly Lys Leu Asn Leu lie Asp lie Glu Gly Asn 
505 510 515 



1651 



att tat gaa agt eat atg ttc age eat gae cag etc ttc tct eta tta 
lie Tyr Glu Ser His Met Phe Ser His Asp Gin Leu Phe Ser Leu Leu 
520 525 530 



1699 



aaa ate acg gtt gat get gat gtt gat act aac gtt gae ate age age 
Lys lie Thr Val Asp Ala Asp Val Asp Thr Asn Val Asp lie Ser Ser 
535 540 545 



1747 



Ctt ate ect gtt cet get gag gat ect aat tea gaa tae gga ttc caa 
Leu lie Pro Val Pro Ala Glu Asp Pro Asn Ser Glu Tyr Gly Phe Gin 
550 555 560 565 



1795 



gga caa tgg aat gtt aat tgg act acg gat aca get aca aat aca aaa 
Gly Gin Trp Asn Val Asn Trp Thr Thr Asp Thr Ala Thr Asn Thr Lys 
570 575 580 



1843 



gag gee acg gea act tgg ace aaa aca gga ttt gtt cec age cec gaa 
Glu Ala Thr Ala Thr Trp Thr Lys Thr Gly Phe Val Pro Ser Pro Glu 
585 590 595 



1891 



aga aaa tct geg tta gta tge aat ace eta tgg gga gtc ttt act gae 
Arg Lys Ser Ala Leu Val Cys Asn Thr Leu Trp Gly Val Phe Thr Asp 
600 605 610 



1939 



att egc tct etg caa cag ctt gta gag ate ggc gea act ggt atg gaa 
lie Arg Ser Leu Gin Gin Leu Val Glu lie Gly Ala Thr Gly Met Glu 
615 620 625 



1987 



cae aaa caa ggt ttc tgg gtt tec tec atg acg aac ttc etg cat aag 
His Lys Gin Gly Phe Trp Val Ser Ser Met Thr Asn Phe Leu His Lys 
630 635 640 645 



2035 
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act gga gat gaa aat cgc aaa ggc ttc cgt cat acc tct gga ggc tac 2083 
Thr Gly Asp Glu Asn Arg Lys Gly Phe Arg His Thr Ser Gly Gly Tyr 
650 655 660 

gtc ate ggt gga agt get cac act cct aaa gac gac eta ttt acc ttt 2131 
Val lie Gly Gly Ser Ala His Thr Pro Lys Asp Asp Leu Phe Thr Phe 
665 670 675 

geg ttc tgc cat etc ttt get aga gac aaa gat tgt ttt ate get cac 2179 
Ala Phe Cys His Leu Phe Ala Arg Asp Lys Asp Cys Phe lie Ala His 
680 685 690 

aac aae tct aga ace tac ggt gga act tta ttc ttc aag cac tct cat 2227 
Asn Asn Ser Arg Thr Tyr Gly Gly Thr Leu Phe Phe Lys His Ser His 
695 700 705 

acc eta eaa ccc caa aac tat ttg aga tta gga aga gca aag ttt tct 2275 
Thr Leu Gin Pro Gin Asn Tyr Leu Arg Leu Gly Arg Ala Lys Phe Ser 
710 715 720 725 

gaa tea get ata gaa aaa ttc cct agg gaa att ccc eta gee ttg gat 2323 
Glu Ser Ala lie Glu Lys Phe Pro Arg Glu lie Pro Leu Ala Leu Asp 
730 735 740 

gtc caa gtt teg ttc age cat tea gac aac cgt atg gaa acg cac tat 2371 
Val Gin Val Ser Phe Ser His Ser Asp Asn Arg Met Glu Thr His Tyr 
745 750 755 

acc tea ttg eca gaa tec gaa ggt tct tgg age aac gag tgt ata get 2419 
Thr Ser Leu Pro Glu Ser Glu Gly Ser Trp Ser Asn Glu Cys lie Ala 
760 765 770 

ggt ggt ate ggc eta gac ctt cct ttt gtt ett tec aac cca cat cct 2467 
Gly Gly lie Gly Leu Asp Leu Pro Phe Val Leu Ser Asn Pro His Pro 
775 780 785 

ctt ttc aag ace ttc att cea cag atg aaa gtc gaa atg gtt tat gta 2515 
Leu Phe Lys Thr Phe lie Pro Gin Met Lys Val Glu Met Val Tyr Val 
790 795 800 805 

tea eaa aat age ttc ttc gaa age tct agt gat ggc cgt ggt ttt agt 2563 
Ser Gin Asn Ser Phe Phe Glu Ser Ser Ser Asp Gly Arg Gly Phe Ser 
810 815 820 

att gga agg ctg ett aac etc teg att cct gtg ggt geg aaa ttc gtg 2611 
lie Gly Arg Leu Leu Asn Leu Ser lie Pro Val Gly Ala Lys Phe Val 
825 830 835 

C3^9 ggg ^^.t ate gga gat tec tac ace tat gat etc tea gga ttc ttt 2659 
Gin Gly Asp lie Gly Asp Ser Tyr Thr Tyr Asp Leu Ser Gly Phe Phe 
840 845 850 



gtt tec gat gtc tat cgt aac aat ccc eaa tct aca geg act ett gtg 
Val Ser Asp Val Tyr Arg Asn Asn Pro Gin Ser Thr Ala Thr Leu Val 
855 860 865 



2707, 
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atg age cca gac tct tgg aaa att cgc ggt ggc aat ctt tea aga cag 
Met Ser Pro Asp Ser Trp Lys lie Arg Gly Gly Asn Leu Ser Arg Gin 
870 875 880 885 



2755 



gca ttt tta ctg agg ggt age aac aac tac gtc tac aac tec aat tgt 
Ala Phe Leu Leu Arg Gly Ser Asn Asn Tyr Val Tyr Asn Ser Asn Cys 
890 895 900 



2803 



gag etc ttc gga cat tac get atg gaa etc cgt gga tct tea agg aac 
Glu Leu Phe Gly His Tyr Ala Met Glu Leu Arg Gly Ser Ser Arg Asn 
905 910 915 



2851 



tac aat gta gat gtt ggt ace aaa etc cga ttc tagattgeta aaaetceeta 2904 
Tyr Asn Val Asp Val Gly Thr Lys Leu Arg Phe 
920 925 



gttettetag ggagttttct catactttta gggaaatatt tgctat 



2950 
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Figure 2 A: Restriction enzyme analysis of CPNl 00634 



Alul 

Hpyl7 8III CviJI 
Apol |MspAlI 
Tsp50 9I I PvuII Msel Sf aNI 

III I I 

GATAAAAATTCTTGACAGCTGTTTTGTCATCTTTAACTTGATTTACTTATTTTGTTTCTA 

1 + + + + + + 60 

CTATTTTTAAGAACTGTCGACAAAACAGTAGAAATTGAACTAAATGAATAAAACAAAGAT 



BsaJI 
Styl 
MboII 
Hinf I 
Tf il 
Bbsl I 
TaqI I 
I I 



Nlalll 
MslI I 
I I 



TATTGATGCGAATAGTTCTCTAAAAAACAAAAGCATTACCATGAAGACTTCGATTCCTTG 

61 + + + + + + 120 

ATAACTACGCTTATCAAGAGATTTTTTGTTTTCGTAATGGTACTTCTGAAGCTAAGGAAC 



121 



HphI Mae I II Alul 

Alul I Nlalll Taal CviJI 

CviJI |MaeIIl| Tsp45I Mnll | 

Mnll| |Tsp45l| Sfcl I Bfal II HphI 

III M I I III I 

GGTTTTAGTTTCCTCCGTGTTAGCTTTCTCATGTCACCTACAGTCACTAGCTAACGAGGA 

+ + + + + + 

CCAAAATCAAAGGAGGCACAATCGAAAGAGTACAGTGGATGTCAGTGATCGATTGCTCCT 



180 



BsaXI 
Alol I 
Ppil I 
EC057I 
Acll I 
Maell I 
Hpyl7 8III 
Hinfl 

Msel Tfil 
Alul I Clal I 

CviJI I TaqI | 

II II 
ACTTTTATCACCTGATGATAGCTTTAATGGAAATATCGATTCAGGAACGTTTACTCCAAA 

181 + + + + + + 240 

TGAAAATAGTGGACTACTATCGAAATTACCTTTATAGCTAAGTCCTTGCAAATGAGGTTT 
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BslI 
ECONI 
ScrFI I 
CviJI I I 
EcoRII I I 
I I I 



AACTTCAGCCACAACATATTCTCTAACAGGAGATGTCTTCTTTTACGAGCCTGGAAAAGG 

241 + + + + + + 

TTGAAGTCGGTGTTGTATAAGAGATTGTCCTCTACAGAAGAAAATGCTCGGACCTTTTCC 



300 



CjePI BsaJI CjePI 
Hpyl88IX Taal Msel | BstDSI BslI | 

I I II I II 

CACTCCCTTATCTGACAGTTGTTTTAAGCAAACCACGGACAATCTTACCTTCTTGGGGAA 

301 + + + + + + 360 

GTGAGGGAATAGACTGTCAACAAAATTCGTTTGGTGCCTGTTAGAATGGAAGAACCCCTT 



CviRI 
Fnu4HI 
Tsel 
Fnu4HI 
AlwNI 



BstAPI 
Mwol 
Tsel 
Aarl 
MslI 
CviRI 



Acll 
Mae 1 1 
Msel I 
Alul I I CviJI 
Taal CviJI j j Sf aNI | 
I III II 



Nlalll 
Bbvl 
Bbvl I 
BspMI I I 
Cac8l| I I 
II I I 



CGGTCATAGCTTAACGTTTGGCTTTATAGATGCTGGCACTCATGCAGGTGCTGCTGCATC 

361 + + + + + + 420 

GCCAGTATCGAATTGCAAACCGAAATATCTACGACCGTGAGTACGTCCACGACGACGTAG 



BseMII 

Hinfl BslI I Ddel Hinfl 

SfaNI Tfil Ddel | | BseMII | Tfil 

I I III II I 

TACAACAGCAAATAAGAATCTTACCTTCTCAGGGTTTTCCTTACTGAGTTTTGATTCCTC 

421 + + + + + + 480 

ATGTTGTCGTTTATTCTTAGAATGGAAGAGTCCCAAAAGGAATGACTCAAAACTAAGGAG 



Maelll 
Mnll I 
Bfal I Taal Sfcl 

II II 



Mnll 
BbvCI I 
BpulOI I 
XmnI Ddel I 

I M 



Apol 
Tsp509I 
BseMII I 
Mnll I I 

I I I 



TCCTAGCACAACGGTTACTACAGGTCAGGGAACGCTTTCCTCAGCAGGAGGCGTAAATTT 

481 + + + + + + 540 

AGGATCGTGTTGCCAATGATGTCCAGTCCCTTGCGAAAGGAGTCGTCCTCCGCATTTAAA 
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Fig. 2 C- (GQn ^ ) 

Bed 

Apol PstI I Hgal 

Sspl Cjel CviRI I I Alul I 

Xmnl| Tsp509I Sfcl [ | | Cvi JI | 

II I M I I II 

AGAAAATATTCGTAAACTTGTAGTTGCTGGGAATTTTTCTACTGCAGATGGTGGAGCTAT 

541 + + + + + + 600 

TCTTTTATAAGCATTTGAACATCAACGACCCTTAAAAAGATGACGTCTACCACCTCGATA 

Hpyl78III Bpml 
Bsri I MboII I 

Cjel Msel SfaNI j Maelll | | 

I III II I 

CAAAGGAGCGTCTTTCCTTTTAACTGGCACTTCTGGAGATGCTCTTTTTAGTAACAACTC 

601 + + + + + + 660 

GTTTCCTCGCAGAAAGGAAAATTGACCGTGAAGACCTCTACGAGAAAAATCATTGTTGAG 

CacSI 

Mwol Haell I 

Mnll Muni BseRI | Hhal | | 

Earl I Tsp509I Sfcl | CacSI | | | Mwol 

i I I I I I III I 

TTCATCAACAAAGGGAGGAGCAATTGCTACTACAGCAGGCGCTCGCATAGCAAATAACAC 

661 + + + + + + 720 

AAGTAGTTGTTTCCCTCCTCGTTAACGATGATGTCGTCCGCGAGCGTATCGTTTATTGTG 

Clal 

Hgal Hpyl78III TaqI 

BsaBI I Maell | Haell j 

Hinfl I I Mnll I Hhal I | 

Tf il I I AccI I I Hin4I | | | 

III III III I 

AGGTTATGTTAGATTCCTATCTAACATAGCGTCTACGTCAGGAGGCGCTATCGATGATGA 

721 + + + + + + 780 

TCCAATACAATCTAAGGATAGATTGTATCGCAGATGCAGTCCTCCGCGATAGCTACTACT 

BsmI 
Fnu4HI I 
CviRI I 

TaqI Apol Tsel j 

Maell I TaqI Tsp509I CjePI | j 

I I I I I II 

AGGCACGTCGATACTATCGAACAACAAATTTCTATATTTTGAAGGGAATGCAGCGAAAAC 

781 + + + + + + 840 

TCCGTGCAGCTATGATAGCTTGTTGTTTAAAGATATAAAACTTCCCTTACGTCGCTTTTG 
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BsaJI 
Styl 
Bsbl 
CjePI 
CviRI 



Cjel 
Dpnl 
Sau3AI 
Bsri I 
Acil| I 
Bbvl I I I 
I I 



Alwl 
Hpyl7 8III 
Dpnl 
BstYI I 
Hin4I I 
Sau3AI I j j Cjel BplI 

, M , I I II II 

TACTGGCGGTGCGATCTGCAACACCAAGGCGAGTGGATCTCCTGAACTGATAATCTCTAA 

841 + + + + + + 

ATGACCGCCACGCTAGACGTTGTGGTTCCGCTCACCTAGAGGACTTGACTATTAGAGATT 

Bed 
Haell 
Hhal 
NlalV 
BsaHI I 

Cjel Acil Narl I I I I Nlalll 

Mae II I Fokl | BanI | j | j j Mwol | 

II I I III 



900 



Dpnl 
Sau3AI I 
Hpyl88IX| I 
Plel Hinf I I I I 
I 



CAATAAGACTCTGATCTTTGCTTCAAACGTAGCAGAAACAAGCGGTGGCGCCATCCATGC 

901 + + + + + + 

GTTATTCTGAGACTAGAAACGAAGTTTGCATCGTCTTTGTTCGCCACCGCGGTAGGTACG 



960 



CviJI 
Cac8I 
Bf al 
Alul I , 

CviJI I I j Hpyl78III Mnll Pf 111081 

Cjel Nhel I I I Mnll | CviJI | Bpml | BsmAI 

I III I I I II II I 

TAAAAAGCTAGCCCTTTCCTCTGGAGGCTTTACAGAGTTTCTACGAAATAATGTCTCATC 

961 + + + + + + 1020 

ATTTTTCGATCGGGAAAGGAGACCTCCGAAATGTCTCAAAGATGCTTTATTACAGAGTAG 



1021 



BseMII 
Banll 
BsiHKAI 
Bspl286I 
Sad 

Hpyl78III Ddel| 
BSU36I I Alul I I I CviRI 
BSU36I Clal Ddel | Cvi JI | | | Sfcl | 

Ddel BslI SfaNI TaqI Sf aNI | | Mnll|| | BsmAI | 

II I I II I III 

AGCAACTCCTAAGGGGGGTGCTATCAGCATCGATGCCTCAGGAGAGCTCAGTCTTTCTGC 

+ + + + + + 1080 

TCGTTGAGGATTCCCCCCACGATAGTCGTAGCTACGGAGTCCTCTCGAGTCAGAAAGACG 
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BseMII Mspl BsaXI 

PstI BsaWI I Rsal | 

I II I I 

AGAGACAGGAAACATTACCTTTGTAAGAAATACCCTTACAACAACCGGAAGTACCGATAC 

1081 + + + + + + 1140 

TCTCTGTCCTTTGTAATGGAAACATTCTTTATGGGAATGTTGTTGGCCTTCATGGCTATG 
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Figure 3 A: Nucleotide and deduced amino acid sequence of CPNl 00635 
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get 


Thr 


Phe 


Ser 


Gly 


Asn 


Gin 


Ala 


Val 


Ala 


Asn 


Gly 


Gly 


Ala 


He 


Tyr 


Ala 


Thr 


Phe 


Ser 


Gly 


Asn 


Gin 


Ala 


Val 


Ala 


Asn 


Gly 


Gly 


Ala 


He 


Tyr 


Ala 






280 










285 










290 








aag 


aag 


ctt 


aca 


ctg 


get 


tee 


ggg 


ggg 


ggg 


ggg 


ggg 


aat 


cec 


ttt 


tct 


Lys 


Lys 


Leu 


Thr 


Leu 


Ala 


Ser 


Gly 


Gly 


Gly 


Gly 


Gly 


Asn 


Pro 


Phe 


Ser 


Lys 


Lys 


Leu 


Thr 


Leu 


Ala 


Ser 


Gly 


Gly 


Gly 


Gly 


Gly 


Asn 


Pro 


Phe 


Ser 




295 










300 










305 











aac aat ata gte 
Asn Asn He Val 
Asn Asn He Val 
310 



caa ggt acc act 
Gin Gly Thr Thr 
Gin Gly Thr Thr 
315 



gea ggt aat ggt 
Ala Gly Asn Gly 
Ala Gly Asn Gly 
320 



gga gcc att tct 1075 
Gly Ala He Ser 
Gly Ala He Ser 
325 



ata ctg gea get gga gag tgt agt ctt tte age gaa gea ggg gae eat 1123 
He Leu Ala Ala Gly Glu Cys Ser Leu Phe Ser Glu Ala Gly Asp His 
He Leu Ala Ala Gly Glu Cys Ser Leu Phe Ser Glu Ala Gly Asp His 
330 335 340 
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tac ctt aat ggg aat gcc att gtt gca act aca cca caa act aca aaa 

Tyr Leu Asn Gly Asn Ala lie Val Ala Thr Thr Pro Gin Thr Thr Lys 

Tyr Leu Asn Gly Asn Ala lie Val Ala Thr Thr Pro Gin Thr Thr Lys 
345 350 355 



1171 



aga aat tct att gac ata gga tot act ggc aaa gat cac gaa tta cgt 1219 

Arg Asn Ser lie Asp lie Gly Ser Thr Gly Lys Asp His Glu Leu Arg 

Arg Asn Ser lie Asp lie Gly Ser Thr Gly Lys Asp His Glu Leu Arg 

360 365 370 

gca ata tct ggg cat age ate ttt ttc tac gat ccg att act get aat 1267 

Ala lie Ser Gly His Ser lie Phe Phe Tyr Asp Pro lie Thr Ala Asn 

Ala lie Ser Gly His Ser lie Phe Phe Tyr Asp Pro lie Thr Ala Asn 
375 380 385 



acg get gcg gat tct aca gat act tta aat etc aat aag get gat gca 1315 
Thr Ala Ala Asp Ser Thr Asp Thr Leu Asn Leu Asn Lys Ala Asp Ala 
Thr Ala Ala Asp Ser Thr Asp Thr Leu Asn Leu Asn Lys Ala Asp Ala 
390 395 400 405 



ggt aat agt aca gat tat agt ggg teg att gtt ttt tct ggt gaa aag 1363 
Gly Asn Ser Thr Asp Tyr Ser Gly Ser lie Val Phe Ser Gly Glu Lys 
Gly Asn Ser Thr Asp Tyr Ser Gly Ser lie Val Phe Ser Gly Glu Lys 
410 415 420 



etc tct gaa gat gaa gca aaa gtt gca gac aac etc act tct acg ctg 1411 
Leu Ser Glu Asp Glu Ala Lys Val Ala Asp Asn Leu Thr Ser Thr Leu 
Leu Ser Glu Asp Glu Ala Lys Val Ala Asp Asn Leu Thr Ser Thr Leu 
425 430 435 

aag cag cet gta act eta act gca gga aat tta gta ctt aaa cgt ggt 1459 

Lys Gin Pro Val Thr Leu Thr Ala Gly Asn Leu Val Leu Lys Arg Gly 

Lys Gin Pro Val Thr Leu Thr Ala Gly Asn Leu Val Leu Lys Arg Gly 
440 445 450 



gte act etc gat acg aaa ggc ttt act cag ace gcg ggt tec tct gtt 1507 
Val Thr Leu Asp Thr Lys Gly Phe Thr Gin Thr Ala Gly Ser Ser Val 
Val Thr Leu Asp Thr Lys Gly Phe Thr Gin Thr Ala Gly Ser Ser Val 
455 460 465 



att atg gat gcg 
lie Met Asp Ala 
lie Met Asp Ala 
470 



ggc aca acg tta 
Gly Thr Thr Leu 
Gly Thr Thr Leu 
475 



aaa gca agt aca 
Lys Ala Ser Thr 
Lys Ala Ser Thr 
480 



gag gag gte act 1555 
Glu Glu Val Thr 
Glu Glu Val Thr 
485 



tta aca ggt ctt tec att cet gta gac tct tta ggc gag ggt aag aaa 1603 
Leu Thr Gly Leu Ser lie Pro Val Asp Ser Leu Gly Glu Gly Lys Lys 
Leu Thr Gly Leu Ser lie Pro Val Asp Ser Leu Gly Glu Gly Lys Lys 
490 495 500 



gtt gta att get get tct gca gca agt aaa aat gta gee ctt agt ggt 1651 
Val Val lie Ala Ala Ser Ala Ala Ser Lys Asn Val Ala Leu Ser Gly 
Val Val lie Ala Ala Ser Ala Ala Ser Lys Asn Val Ala Leu Ser Gly 
505 510 515 



20/77 



^ig. 3P (con't) 



ANNOTATED SHEET SHOWING CHANGES 

Title: CHLAMYDIA ANTIGENS AND 
CORRESPONDING DNA FRAGMENTS AND 
USES THEREOF 
Inventor(s): Andrew D. MURDIN et al. 
Appl. No.: 09/857,128 



ccg att ctt ctt ttg gat aac caa ggg aat get tat gaa aat cac gac 
Pro lie Leu Leu Leu Asp Asn Gin Gly Asn Ala Tyr Glu Asn His Asp 
Pro lie Leu Leu Leu Asp Asn Gin Gly Asn Ala Tyr Glu Asn His Asp 
520 525 530 



1699 



tta 


gga 


aaa 


act 


caa 


gac 


ttt 


tea 


ttt 


gtg 


eag 


etc 


tct 


get 


ctg 


ggt 


Leu 


Gly 


Lys 


Thr 


Gin 


Asp 


Phe 


Ser 


Phe 


Val 


Gin 


Leu 


Ser 


Ala 


Leu 


Gly 


Leu 


Gly 


Lys 


Thr 


Gin 


Asp 


Phe 


Ser 


Phe 


Val 


Gin 


Leu 


Ser 


Ala 


Leu 


Gly 




535 










540 










545 











act 


gca 


aca 


act 


aca 


gat 


gtt 


cca 


gcg 


gtt 


cet 


aca 


gta 


gca 


act 


cet 


Thr 


Ala 


Thr 


Thr 


Thr 


Asp 


Val 


Pro 


Ala 


Val 


Pro 


Thr 


Val 


Ala 


Thr 


Pro 


Thr 


Ala 


Thr 


Thr 


Thr 


Asp 


Val 


Pro 


Ala 


Val 


Pro 


Thr 


Val 


Ala 


Thr 


Pro 


550 










555 










560 










565 


acg 


cae 


tat 


ggg 


tat 


caa 


ggt 


act 


tgg 


gga 


atg 


act 


tgg 


gtt 


gat 


gat 


Thr 


His 


Tyr 


Gly 


Tyr 


Gin 


Gly 


Thr 


Trp 


Gly 


Met 


Thr 


Trp 


Val 


Asp 


Asp 


Thr 


His 


Tyr 


Gly 


Tyr 


Gin 


Gly 


Thr 


Trp 


Gly 


Met 


Thr 


Trp 


Val 


Asp 


Asp 










570 










575 










580 





acc gca age act eea aag act aag 
Thr Ala Ser Thr Pro Lys Thr Lys 
Thr Ala Ser Thr Pro Lys Thr Lys 
585 



aca 


gcg 


aca 


tta 


get 


tgg 


ace 


aat 


Thr 


Ala 


Thr 


Leu 


Ala 


Trp 


Thr 


Asn 


Thr 


Ala 


Thr 


Leu 


Ala 


Trp 


Thr 


Asn 


590 










595 







aca 


ggc 


tac 


ctt 


ccg 


aat 


cet 


gag 


egt 


caa 


gga 


cet 


tta 


gtt 


cet 


aat 


Thr 


Gly 


Tyr 


Leu 


Pro 


Asn 


Pro 


Glu 


Arg 


Gin 


Gly 


Pro 


Leu 


Val 


Pro 


Asn 


Thr 


Gly 


Tyr 


Leu 


Pro 


Asn 


Pro 


Glu 


Arg 


Gin 


Gly 


Pro 


Leu 


Val 


Pro 


Asn 






600 










605 










610 









age 


ctt 


tgg 


gga 


Ser 


Leu 


Trp 


Gly 


Ser 


Leu 


Trp 


Gly 




615 






gag 


aga 


agt 


get 


Glu 


Arg 


Ser 


Ala 


Glu 


Arg 


Ser 


Ala 



tct ttt tea gac 
Ser Phe Ser Asp 
Ser Phe Ser Asp 
620 

ttg act. ctt tgt 
Leu Thr Leu Cys 
Leu Thr Leu Cys 
635 



ate caa gcg att 
lie Gin Ala lie 
lie Gin Ala He 
625 

tea gat cga ggc 
Ser Asp Arg Gly 
Ser Asp Arg Gly 
640 



caa ggt gte ata 1987 
Gin Gly Val lie 
Gin Gly Val He 



ttc tgg get gcg 2035 
Phe Trp Ala Ala 
Phe Trp Ala Ala 
645 



gga gte gee aat 
Gly Val Ala Asn 
Gly Val Ala Asn 



ttc tta gat aaa 
Phe Leu Asp Lys 
Phe Leu Asp Lys 
650 



gat aag aaa ggg 
Asp Lys Lys Gly 
Asp Lys Lys Gly 
655 



gaa 


aaa 


ege 


aaa 


Glu 


Lys 


Arg 


Lys 


Glu 


Lys 


Arg 


Lys 






660 





tac egt cat aaa tct ggt gga tat 
Tyr Arg His Lys Ser Gly Gly Tyr 
Tyr Arg His Lys Ser Gly Gly Tyr 
665 



get 


ate 


gga 


ggt gea 


gcg 


caa 


act 


2131 


Ala 


He 


Gly 


Gly Ala 


Ala 


Gin 


Thr 




Ala 


He 


Gly 


Gly Ala 


Ala 


Gin 


Thr 




670 








675 









tgt 


tct 


gaa 


aac 


Cys 


Ser 


Glu 


Asn 


Cys 


Ser 


Glu 


Asn 






680 





tta att age ttt 
Leu He Ser Phe 
Leu He Ser Phe 
685 



gee ttt tge caa 
Ala Phe Cys Gin 
Ala Phe Cys Gin 



etc 


ttt 


ggt 


age 


Leu 


Phe 


Gly 


Ser 


Leu 


Phe 


Gly 


Ser 


690 









m 0 a 2004 ^ 

* ^'r^c^^^g. 3 & (con't) 



gat 
Asp 
Asp 


aaa 
Lys 
Lys 
695 


gat 
Asp 
Asp 


ttc 
Phe 
Phe 


tta 
Leu 
Leu 


gte 
Val 
Val 


get 
Ala 
Ala 
700 


aaa 
Lys 
Lys 


aat 
Asn 
Asn 


gcc 
Ala 
Ala 
710 


ttc 
Phe 
Phe 


tat 
Tyr 
Tyr 


ate 

lie 
He 


eaa 
Gin 
Gin 


eac 
His 
His 
715 


att 
lie 
He 


aca 
Thr 
Thr 


gaa 
Glu 
Glu 


etc 
Leu 
Leu 


tta 
Leu 
Leu 


gat 
Asp 
Asp 


aaa 
Lys 
Lys 


ett 
Leu 
Leu 
730 


cct 
Pro 
Pro 


gge 
Gly 
Gly 


tct 
Ser 
Ser 


tgg 
Trp 
Trp 


gaa 
Glu 
Glu 


ggg 

Gly 
Gly 


cag 
Gin 
Gin 


etc 
Leu 
Leu 
745 


get 
Ala 
Ala 


tat 
Tyr 
Tyr 


age 
Ser 
Ser 


eac 
His 
His 


gtc 
Val 
Val 
750 


tat 
Tyr 
Tyr 


act 
Thr 
Thr 


geg 
Ala 
Ala 
760 


tat 
Tyr 
Tyr 


cct 
Pro 
Pro 


gag 
Glu 
Glu 


gtg 
Val 
Val 


aaa 
Lys 
Lys 
765 


ggt 
Gly 
Gly 


aac 
Asn 
Asn 


atg 
Met 
Met 
775 


atg 
Met 
Met 


ttg 
Leu 
Leu 


gga 
Gly 
Gly 


get 
Ala 
Ala 


tct 
Ser 
Ser 
780 


tct 
Ser 
Ser 


cat 
His 
His 


tgt 
Cys 
Cys 
790 


ttt 

Phe 
Phe 


gat 
Asp 
Asp 


acc 
Thr 
Thr 


tat 
Tyr 
Tyr 


get 
Ala 
Ala 
795 


cca 
Pro 
Pro 


tac 
Tyr 
Tyr 


ate 
He 
He 


cgt 
Arg 
Arg 


cag 
Gin 
Gin 


gac 
Asp 
Asp 


age 
Ser 
Ser 


ttc 
Phe 
Phe 
810 


teg 
Ser 
Ser 


gag 
Glu 
Glu 


aaa 
Lys 
Lys 


ggt 
Gly 
Gly 


gac 
Asp 
Asp 


age 
Ser 
Ser 


aac 
Asn 
Asn 


etc 
Leu 
Leu 
825 


ttc 
Phe 
Phe 


aat 
Asn 
Asn 


tta 
Leu 
Leu 


tct 
Ser 
Ser 


ttg 
Leu 
Leu 
830 


aag 
Lys 
Lys 


ttc 
Phe 
Phe 


tct 
Ser 
Ser 
840 


gat 

Asp 
Asp 


tgt 
Cys 
Cys 


aat 
Asn 
Asn 


gac 
Asp 
Asp 


ttt 
Phe 
Phe 
845 


tct 
Ser 
Ser 


gtt 
Val 
Val 


ect 
Pro 
Pro 
855 


gat 
Asp 
Asp 


ett 
Leu 
Leu 


ate 
He 
He 


ege 
Arg 
Arg 


aat 
Asn 
Asn 
860 


gat 
Asp 
Asp 


cec 
Pro 
Pro 
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cat 
His 
His 


act 
Thr 
Thr 


gat 
Asp 
Asp 
705 


acc 
inr 
Thr 


tat 

Tyr 
Tyr 


gca 
Ala 
Ala 


gga 
vjiy 
Gly 


2227 


tgt 
Cys 
Cys 


agt 
Ser 
Ser 
720 


ggg 

Gly 
Gly 


ttc 

"Oil ^ 

pne 
Phe 


ata 
lie 
He 


ggt 
vjiy 
Gly 


tgt 
Cys 
Cys 
725 


2275 


agt 
Ser 
Ser 
735 


cat 
His 
His 


aaa 
Lys 
Lys 


cec 
Pro 
Pro 


etc 
Leu 
Leu 


gtt 
Val 
Val 
740 


tta 
Leu 
Leu 


2323 


agt 
Ser 
Ser 


aat 
Asn 
Asn 


gat 
Asp 
Asp 


etg 
Leu 
Leu 


aag 
Lys 
Lys 
755 


aca 
Thr 
Thr 


aag 
Lys 
Lys 


2371 


tct 
Ser 
Ser 


tgg 
Trp 
Trp 


ggg 

Gly 
Gly 


aat 
Asn 
Asn 
770 


aat 
Asn 
Asn 


get 
Ala 
Ala 


ttt 
Phe 
Phe 


2419 


tct 
Ser 
Ser 


tat 
Tyr 
Tyr 


ect 
Pro 
Pro 
785 


gaa 
Glu 
Glu 


tac 
Tyr 
Tyr 


etg 
Leu 
Leu 


cat 
His 
His 


2467 


aaa 
Lys 
Lys 


etg 
Leu 
Leu 
800 


aat 
Asn 
Asn 


etg 
Leu 
Leu 


acc 
Thr 
Thr 


tat 
Tyr 
Tyr 


ata 
1 le 
He 
805 


2515 


aca 
Thr 
Thr 
815 


gaa 
Glu 
Glu 


gga 
Gly 
Gly 


aga 
Arg 
Arg 


tct 
Ser 
Ser 


ttt 
Phe 
Phe 
820 


gat 
Asp 
Asp 


2563 


cct 
Pro 
Pro 


ata 
lie 
He 


ggg 

Gly 
Gly 


gtg 
Val 
Val 


aag 
Lys 
Lys 
835 


ttt 
pne 
Phe 


gag 
Glu 
Glu 


2611 


tat 
Tyr 
Tyr 


gat 
Asp 
Asp 


etg 
Leu 
Leu 


act 
Thr 
Thr 
850 


tta 
Leu 
Leu 


tec 
Ser 
Ser 


tat 
Tyr 
Tyr 


2659 


aaa 
Lys 
Lys 


tgc 
Cys 
Cys 


act 
Thr 
Thr 


aca 
Thr 
Thr 


gca 
Ala 
Ala 


ett 
Leu 
Leu 


gta 
Val 
Val 


2707 



865 
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ate 


age 


gga 


gcc 


tct 


tgg 


gaa 


act 


tat 


gcc 


aat 


aac 


tta 


gca 


cga 


cag 


He 


Ser 


Gly 


Ala 


Ser 


Trp 


Glu 


Thr 


Tyr 


Ala 


Asn 


Asn 


Leu 


Ala 


Arg 


Gin 


He 


Ser 


Gly 


Ala 


Ser 


Trp 


Glu 


Thr 


Tyr 


Ala 


Asn 


Asn 


Leu 


Ala 


Arg 


Gin 


870 










875 










880 










885 



gcc ttg caa gtg cgt gca ggc agt cac tac gcc ttc tct cct atg ttt 2803 
Ala Leu Gin Val Arg Ala Gly Ser His Tyr Ala Phe Ser Pro Met Phe 
Ala Leu Gin Val Arg Ala Gly Ser His Tyr Ala Phe Ser Pro Met Phe 
890 895 900 



gaa gtg etc ggc cag ttt gtc ttt gaa gtt cgt gga tec tea egg att 2851 
Glu Val Leu Gly Gin Phe Val Phe Glu Val Arg Gly Ser Ser Arg He 
Glu Val Leu Gly Gin Phe Val Phe Glu Val Arg Gly Ser Ser Arg He 
905 910 915 



tat 


aat 


gta 


gat 


ctt 


ggg 


ggt 


aag 


ttc 


caa 


ttc 


Tyr 


Asn 


Val 


Asp 


Leu 


Gly 


Gly 


Lys 


Phe 


Gin 


Phe 


Tyr 


Asn 


Val 


Asp 


Leu 


Gly 


Gly 


Lys 


Phe 


Gin 


Phe 






920 










925 









cagaaattet gagagagate geatttagga ttttettaaa cacgae 



2950 



<5N 
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Figure 4 A: Restriction enzyme analysis of CPN 100635 



Apol 
Tsp509I 
Hpyl7 8III 
Alul 
CviJI 
MspAlI 
PvuII 



Ddel 
Hpyl8 8IX 
Hinf I 
Hpyl78III I 
BseMIll I 

Msel PvuII I I Plel I I I 

I II I 

AGTTAAAGATGACAAAACAGCTGTCAAGAATTTTTATCTTGACTCTCTGAGTTTTCTATT 

1 + + + + + + 60 

TCAATTTCTACTGTTTTGTCGACAGTTCTTAAAAATAGAACTGAGAGACTCAAAAGATAA 



ApoI ScrFI 
Tsp509I EcoRII I 

Hgal |MseI Tsp509I | j 

III III 
TTATATGACGCAAGTAAGAATTTAATAATAAAGTGGGTTTATGAAATCGCAATTTTCCTG 

61 + + + + + + 120 

AATATACTGCGTTCATTCTTAAATTATTATTTCACCCAAATACTTTAGCGTTAAAAGGAC 



BsiHKAI 
Bspl286I 
Mboll I 



Earl Nlalll Bfal 

TaqI I MslI Nspl Spel| Bbvl 

I I I I I I II I 

GTTAGTGCTCTCTTCGACATTGGCATGTTTTACTAGTTGTTCCACTGTTTTTGCTGCAAC 

121 + + + + + + 180 

CAATCACGAGAGAAGCTGTAACCGTACAAAATGATCAACAAGGTGACAAAAACGACGTTG 



Mwol 
CviRI 
Fnu4HI I 
TspRI I I 

Taal |Tsel| | 
II III 



181 



Hpyl88IX 
NlalV 
CviJI I 

Haelllj i Mnll Rsal 

Eco0109l|| I Alul I Seal 

Sau96lj| I CviJI I TatI | 

III I II II III 
TGCTGAAAATATAGGCCCCTCTGATAGCTTTGACGGAAGTACTAACACAGGCACCTATAC 
+ + + + + + 

ACGACTTTTATATCCGGGGAGACTATCGAAACTGCCTTCATGATTGTGTCCGTGGATATG 



NlalV 
BanI iBsaXI 



240 



Cjel 

Cjel BsrI Hpyl88IX CviRI 

I II I 

TCCTAAAAATACGACTACTGGAATAGACTATACTCTGACAGGAGATATAACTCTGCAAAA 

241 + + + + + + 

AGGATTTTTATGCTGATGACCTTATCTGATATGAGACTGTCCTCTATATTGAGACGTTTT 



300 



Fig. 4<& (-Gon 't) 
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Msel 
Alul 
CviJI 

BsaJI Hinfl Fnu4HI | 
Styl Tfil Tsel| | 
I I II I 



Alul 
CviJI 

Hinfl Hindlll | 
Bbvl Hpyl88IX Tfil Msel | | 

I I I I II 

CCTTGGGGATTCGGCAGCTTTAACGAAGGGTTGTTTTTCTGACACTACGGAATCTTTAAG 

301 + + + + + + 

GGAACCCCTAAGCCGTCGAAATTGCTTCCCAACAAAAAGACTGTGATGCCTTAGAAATTC 



360 



Fnu4HI 
Hhal I 

Mspl Dral Msel Tsel | 

BsrFI I BslI Rsal Msel|SspI | Bfal Mwol || 

II I I II II I I II 

CTTTGCCGGTAAGGGGTACTCACTTTCTTTTTTAAATATTAAGTCTAGTGCTGAAGGCGC 

361 + + + + + + 

GAAACGGCCATTCCCCATGAGTGAAAGAAAAAATTTATAATTCAGATCACGACTTCCGCG 



420 



EC057I 

Bbvl I Hinfl 
CviJI Maelll | | BsaBI TaqI | Plel 

I III I Ml 

AGCCCTTTCTGTTACAACTGATAAAAATCTGTCGCTAACAGGATTTTCGAGTCTTACTTT 

421 + + + + + + 480 

TCGGGAAAGACAATGTTGACTATTTTTAGACAGCGATTGTCCTAAAAGCTCAGAATGAAA 



Bed 
NlalV 
CviJI 
Haelll 
Sau96I 
Fnu4HI 
Taul 



Acil I 
Cjel I I 
Ddel I I I 
II II 



481 



Hpyl78III 
Cjel I 
BSU36I I I 
BslI Ddel I I 

I I II 
CTTAGCGGCCCCATCATCGGTAATCACAACCCCCTCAGGAAAAGGTGCAGTTAAATGTGG 
+ + + + + + 

GAATCGCCGGGGTAGTAGCCATTAGTGTTGGGGGAGTCCTTTTCCACGTCAATTTACACC 



Mnll 
Msel 
BseMII I 
CviRI I I 
Mnll I I I 
I III 



540 



Bsgl 
Dpnl 

BstYI I Dral Taal 

SauSAI I Alwl Msel I Mnll | 

III II I I 

AGGGGATCTTACATTTGATAACAATGGAACTATTTTATTTAAACAAGATTACTGTGAGGA 
541 + + + + + + 

TCCCCTAGAATGTAAACTATTGTTACCTTGATAAAATAAATTTGTTCTAATGACACTCCT 



600 



Fig. 4C (oon ' t) 
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CviJI 

Nlaiv| 
Ecil I I 
Acill II 
II II 



Alwl 



601 



Hinf I 
Tfil Sthl32I 

I I 

AAATGGCGGAGCCATTTCTACCAAGAATCTTTCTTTGAAAAACAGCACGGGATCGATTTC 

+ + + + + + 

TTTACCGCCTCGGTAAAGATGGTTCTTAGAAAGAAACTTTTTGTCGTGCCCTAGCTAAAG 



Clal 

Taqi 
Dpnl I 
Sau3AI I I 
BscGI I I I 
II II 



660 



661 



Hhal 

CjePl| CjePI 
TaqI I I CviJI Mwol | 

I II III 

TTTTGAAGGGAATAAATCGAGCGCAACAGGGAAAAAAGGTGGGGCTATTTGTGCTACTGG 

+ + + + + + 

AAAACTTCCCTTATTTAGCTCGCGTTGTCCCTTTTTTCCACCCCGATAAACACGATGACC 



720 



Taal 
Sf Cl I 
Bsri I I 
Rsal I I 
I II 



MboII 
NlaIV| 
CviJI'.| I 
III 



BspMI 
Sspl 



Bbvl 
Mnll 
Earl 
Hpyl78III 
TaqI 
Bcefl I 
I I 



Fnu4HI 
Sfcl 
Alul 
CviJI 
Mwol 
Tsel 
I 



TACTGTAGATATTACAAATAATACGGCTCCTACCCTCTTCTCGAACAATATTGCTGAAGC 

721 + + + + + + 

ATGACATCTATAATGTTTATTATGCCGAGGATGGGAGAAGAGCTTGTTATAACGACTTCG 



780 



EC057I 

Alul I 

CviJI I Tsp509I 

Mwol I I Rsal 

Aarl I I I BsrGI | 

Pstlj I I I Taal I I BsmAI 

CviRI I I I I I TatI I I Maell BsmBI 

I II I I I III I I 

TGCAGGTGGAGCTATAAATAGCACAGGAAACTGTACAATTACAGGGAATACGTCTCTTGT 

781 + + + + + + 840 

ACGTCCACCTCGATATTTATCGTGTCCTTTGACATGTTAATGTCCCTTATGCAGAGAACA 



Hpyl88IX 



Maelll 
Tsp4 5I 



Acil 
BsiEI Mnll 



BseRI 
Hpyl78III 
Banll 
BsiHKAI 
Bspl286I 
Sad 
Sf aNI 
Alul I 
CviJI I I jCjel 

II III 



ATTTTCTGAAAATAGTGTGACAGCGACCGCAGGAAATGGAGGAGCTCTTTCTGGAGATGC 

841 + + + + + + 900 

TAAAAGACTTTTATCACACTGTCGCTGGCGTCCTTTACCTCCTCGAGAAAGACCTCTACG 
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Fig. 4 0 (con*t) 



Maelll 



Maelll 
Hpyl88IX 
Hinfl I 
BslI I I 
Bpml |TfiI I 
I I I I 



Hpyl78III 
Cjel Ddel I 
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BseMII 
Sf Cl 
Alul I Alul 
Cvi JI Cvi JI 



I 



CGATGTTACCATATCTGGGAATCAGAGTGTAACTTTCTCAGGAAACCAAGCTGTAGCTAA 

901 + + + + + + 960 

GCTACAATGGTATAGACCCTTAGTCTCACATTGAAAGAGTCCTTTGGTTCGACATCGATT 



Neil 
ScrFI 
BsaJI 
Mspl 
BsrI 
TspRI 
CviJI 



Sthl32I 
Alul 

CviJI CviJI 
NlaIV| Hindlll | 

Ecil I I Ddel I I 

Acil I I I Mwol I I I 

MM II II 



BslI 
I 



Hinfl 
Tf il 



TGGCGGAGCCATTTATGCTAAGAAGCTTACACTGGCTTCCGGGGGGGGGGGGGGGAATCC 

961 + + + + + + 1020 

ACCGCCTCGGTAAATACGATTCTTCGAATGTGACCGAAGGCCCCCCCCCCCCCCCTTAGG 



PstI 
TspRI 
CviRI 
HgiEII 
BtsI 
Sf cl 
Kpnl 
NlalV 



Rsal 
BanI 
BspMI 
BsaJI I 
Styl I 
Cjel I I 
I I I 



CviJI 
Nlaiv| 
I I 



Xcml 
Cjel I 



CTTTTCTAACAATATAGTCCAAGGTACCACTGCAGGTAATGGTGGAGCCATTTCTATACT 

1021 + + + + + + 1080 

GAAAAGATTGTTATATCAGGTTCCATGGTGACGTCCATTACCACCTCGGTAAAGATATGA 



0 8 2004 



^ Fig. 4 £4€©Prt)- 
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Alul 
CviJI 
MspAlI 
PvuII 
Bsri I 
Fnu4HI 1 
Tsel| 1 

I 



Bbvl 



Bpml 



NlalV 
Avail I 
Sau96I I 
I I 



BsmFI 
Msel I 
Xcml I I 
II I 



GGCAGCTGGAGAGTGTAGTCTTTTCAGCGAAGCAGGGGACCATTACCTTAATGGGAATGC 

1081 + + + + + + 1140 

CCGTCGACCTCTCACATCAGAAAAGTCGCTTCGTCCCCTGGTAATGGAATTACCCTTACG 

Dpnl 
BstYI 

CviRI Sau3AI 
Mwol I Apol HaelV I I Bcgl 

BsmI I I Tsp509I Hin4I | | Alwl | 

III I II 

CATTGTTGCAACTACACCACAAACTACAAAAAGAAATTCTATTGACATAGGATCTACTGG 

1141 + + + + + + 1200 

GTAACAACGTTGATGTGGTGTTTGATGTTTTTCTTTAAGATAACTGTATCCTAGATGACC 



Hpyl8 8IX 
Dpnl 
Sau3AI 

CviRI Pf 111081 I 

BsaAI I SfaNI I I 

Maelll I Bcgl Alwl | || | | Bbvl 

III I II II I I I 

CAAAGATCACGAATTACGTGCAATATCTGGGCATAGCATCTTTTTCTACGATCCGATTAC 

1201 + + + + + + 

GTTTCTAGTGCTTAATGCACGTTATAGACCCGTATCGTAGAAAAAGATGCTAGGCTAATG 



Tsp509I 
Hpyl78III 
Dpnl I 
Sau3AI I I 
BsrI I I I 
I I I I 



1260 



1261 



Hinf I 
Tf il 
Acil 
Fnu4HI I 

CviJI I I j CviJI 
Tsel I I I Beef I Dral BspMI | 

Mwol II I jsfcl I Msel I SfaNI | j CviRI 

I III I I I II III I 

TGCTAATACGGCTGCGGATTCTACAGATACTTTAAATCTCAATAAGGCTGATGCAGGTAA 

+ + + + + + 1320 

ACGATTATGCCGACGCCTAAGATGTCTATGAAATTTAGAGTTATTCCGACTACGTCCATT 



Hpyl8 8IX 
HphI I 

Rsal TaqI Alul | j 

TatI I SimI I CviJI | j 

II II III 

TAGTACAGATTATAGTGGGTCGATTGTTTTTTCTGGTGAAAAGCTCTCTGAAGATGAAGC 

1321 + + + + + + 1380 

ATCATGTCTAATATCACCCAGCTAACAAAAAAGACCACTTTTCGAGAGACTTCTACTTCG 



Fig. A ^^GQP^ 



EC057I 
CviRI 
BstAPI I 
Mwol I 
MboII I I 
I I 
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Mnll 



Maelll 
CviJI 
Fnu4HI I 
Mwol I i 
Tselj I 
II I 



ANNOTATED SHEET SHOWING CHANGES 

Title: CHLAMYDIA ANTIGENS AND 
CORRESPONDING DNA FRAGMENTS AND 
USES THEREOF 
Inventor(s): Andrew D. MURDIN et al. 
Appl. No.: 09/857,128 



Apol 
Tsp509I 
PstI 
CviRI 
Sf Cl I 
Eco57l| I 
Bbvl I I I 
II I 



I I II I II I I I II I I I 

AAAAGTTGCAGACAACCTCACTTCTACGCTGAAGCAGCCTGTAACTCTAACTGCAGGAAA 

1 + + + + + + 

TTTTCAACGTCTGTTGGAGTGAAGATGCGACTTCGTCGGACATTGAGATTGACGTCCTTT 



BseMII 
NlalV 

SacII 

Acil 



Mae 1 1 
Msel I 
Rsal I 1 
Seal I 
Tat I I I 



Hpyl78III 
Maelll I 
Tsp45I TaqI 

I I 



MspAlI 
Thai 
Acil 
BsaJI 
BstDSI 
Hpyl8 8IX 
Faul 
Ddel I 
Sthl32I I 
CviJI I I 

I I I 



TTTAGTACTTAAACGTGGTGTCACTCTCGATACGAAAGGCTTTACTCAGACCGCGGGTTC 

1441 + + + + + + 1500 

AAATCATGAATTTGCACCACAGTGAGAGCTATGCTTTCCGAAATGAGTCTGGCGCCCAAG 



Faul 
Mnll 
Sthl32I I 
SfaNI I I 

I II 



Msel 
Acll 
Fokl 
Mae 1 1 
Bspl286I 
Bmgl 
BseSI 
Cac8I I 
Acil I I 
I I I 



Mnll 
Rsal 
TatI I 
Mnll I I 
II I 



Maelll 
Tsp45I 



BseRI 
Msel I 



CTCTGTTATTATGGATGCGGGCACAACGTTAAAAGCAAGTACAGAGGAGGTCACTTTAAC 

1501 + + + + + + 1560 

GAGACAATAATACCTACGCCCGTGTTGCAATTTTCGTTCATGTCTCCTCCAGTGAAATTG 



Hinf I 
ACCI I 

Sfcl I I Fnu4HI Sfcl 

Plel| I I Mnll Bbvl Tsp509I Tsel | Mwol | 

II II I I I II II 

AGGTCTTTCCATTCCTGTAGACTCTTTAGGCGAGGGTAAGAAAGTTGTAATTGCTGCTTC 

1561 + + + + + + 

TCCAGAAAGGTAAGGACATCTGAGAAATCCGCTCCCATTCTTTCAACATTAACGACGAAG 



1620 
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Hinf I 

PstI Tfil 

Fnu4HI I Hpyl88IX 

CviRI I I Avail I 

Mwoljl Ddel MboII I I BsaJI 

Tsel Bbvl CviJI | Sau96I j | Styl 

III I I I I M I 

TGCAGCAAGTAAAAATGTAGCCCTTAGTGGTCCGATTCTTCTTTTGGATAACCAAGGGAA 

1621 + + + + + + 

ACGTCGTTCATTTTTACATCGGGAATCACCAGGCTAAGAAGAAAACCTATTGGTTCCCTT 



1680 



Alul 
CviJI 
Fnu4HI I 

Hpyl7 8III CviRI I j 

Smll I Tsel I j 

II III 
TGCTTATGAAAATCACGACTTAGGAAAAACTCAAGACTTTTCATTTGTGCAGCTCTCTGC 

1681 + + + + + + 1740 

ACGAATACTTTTAGTGCTGAATCCTTTTTGAGTTCTGAAAAGTAAACACGTCGAGAGACG 



BsmI 



Ddel 
Bce83I I 
Hpyl78III I 
I 



Sfcl 
NlalV I 
Acil I I Cjel 
Sfcl MspAlI I I Taal | 

I I I I I I 

TCTGGGTACTGCAACAACTACAGATGTTCCAGCGGTTCCTACAGTAGCAACTCCTACGCA 

1741 + + + + + + 1800 

AGACCCATGACGTTGTTGATGTCTACAAGGTCGCCAAGGATGTCATCGTTGAGGATGCGT 



CviRI 

Acelll Bsgl I 
Bbvl I Rsal I I 

M II 



Cjel 



Rsal 



Cac8I 
Acil I 



I 



CTATGGGTATCAAGGTACTTGGGGAATGACTTGGGTTGATGATACCGCAAGCACTCCAAA 

1801 + + + + + + 1860 

GATACCCATAGTTCCATGAACCCCTTACTGAACCCAACTACTATGGCGTTCGTGAGGTTT 



Avail 
Sau96I 
Alul I 
CviJI I 
Mwol I I 
I I I 



BpulOI 
Ddel 
Hpyl78III 
Hinf I 
Tfil 
Hgal 



Tthlllll 

Ddel I Mwol I I CviJI 

II 

GACTAAGACAGCGACATTAGCTTGGACCAATACAGGCTACCTTCCGAATCCTGAGCGTCA 

1861 + + + + + + 1920 

CTGATTCTGTCGCTGTAATCGAACCTGGTTATGTCCGATGGAAGGCTTAGGACTCGCAGT 



Hpyl8 8IX 
BseMII I 

I I 
I I 



, MAR 0 8 201H 
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Avail - 

EC00109I Dpnl 

PspSII BstYI I 

Sau96I Fokl |Hpyl88IX Hinfl 

Sse8647I Alol CviJI Sau3AI | Alwl | Tfil 

II I I I I I I 

AGGACCTTTAGTTCCTAATAGCCTTTGGGGATCTTTTTCAGACATCCAAGCGATTCAAGG 

1921 + + + + + + 1980 

TCCTGGAAATCAAGGATTATCGGAAACCCCTAGAAAAAGTCTGTAGGTTCGCTAAGTTCC 



Mwol 
Hinfl 
Acil 
Fnu4HI 
CviJI 
Faul I 



Bbvl 
Taqi 
Dpnl I 
Sau3AI I 
HpylBBIXl I 
Plel Hinfl Mnll | | | 

II III! 
TGTCATAGAGAGAAGTGCTTTGACTCTTTGTTCAGATCGAGGCTTCTGGGCTGCGGGAGT 

1981 + + + + + + 2040 

ACAGTATCTCTCTTCACGAAACTGAGAAACAAGTCTAGCTCCGAAGACCCGACGCCCTCA 



Sthl32l| I 
CviJI I |TseI 
III I 



Plel 

Tsp509l| Ddel Taal 

III I 

CGCCAATTTCTTAGATAAAGATAAGAAAGGGGAAAAACGCAAATACCGTCATAAATCTGG 



2041 



-+ 2100 



GCGGTTAAAGAATCTATTTCTATTCTTTCCCCTTTTTGCGTTTATGGCAGTATTTAGACC 



HpylSSIX 
Mnll I 
HaeIV| I 
Hin4l| I 
II I 



Tsp509I 

Fnu4HI Bsgl I 

CviRI I Hpyl88IX | | Alul 

Tsel|HhaI Bbvl | | Msel | CviJI 
II I I I I II I 



TGGATATGCTATCGGAGGTGCAGCGCAAACTTGTTCTGAAAACTTAATTAGCTTTGCCTT 



2101 



-+ 2160 



ACCTATACGATAGCCTCCACGTCGCGTTTGAACAAGACTTTTGAATTAATCGAAACGGAA 
BstXI Ddel 

I I 

TTGCCAACTCTTTGGTAGCGATAAAGATTTCTTAGTCGCTAAAAATCATACTGATACCTA 

2161 + + + + + + 2220 

AACGGTTGAGAAACCATCGCTATTTCTAAAGAATCAGCGATTTTTAGTATGACTATGGAT 



CviJI 

NlaIV| BsmAI 
CviRI I I Bsbl Mmel Ddel | 

I II I I II 

TGCAGGAGCCTTCTATATCCAACACATTACAGAATGTAGTGGGTTCATAGGTTGTCTCTT 

2221 + + + + + + 2280 

ACGTCCTCGGAAGATATAGGTTGTGTAATGTCTTACATCACCCAAGTATCCAACAGAGAA 



31/77 



Fig.4l {con't) 



ANNOTATED SHEET SHOWING CHANGES 

Title: CHLAMYDIA ANTIGENS AND 
CORRESPONDING DNA FRAGMENTS AND 
USES THEREOF 
Inventor(s): Andrew D. MURDIN et al. 
Appl. No.: 09/857,128 



Cac8I 
Alul 

CviJI CviJI 
ScrFI I Fnu4HI | 

EcoRII I I Hinfl Plel Mnll Tsel | | 

III I I I III 

AGATAAACTTCCTGGCTCTTGGAGTCATAAACCCCTCGTTTTAGAAGGGCAGCTCGCTTA 

2281 - + + + + + + 2340 

TCTATTTGAAGGACCGAGAACCTCAGTATTTGGGGAGCAAAATCTTCCCGTCGAGCGAAT 



Mae 1 1 
CviJI I 
Acellll I 
Bbvl I 



BSU36I 
Ddel 
Hpyl78III 
EC057I 
BseMII 
MboII 



Hpyl88IX BstZ17l| 
Dpnl I AccI I I 

SauSAI I f Bbsl III I Mnll I | BciVI 

III I ill I III I II I I 

TAGCCACGTCAGTAATGATCTGAAGACAAAGTATACTGCGTATCCTGAGGTGAAAGGTTC 

2341 + + + + + + 2400 

ATCGGTGCAGTCATTACTAGACTTCTGTTTCATATGACGCATAGGACTCCACTTTCCAAG 



MboII Alul 

HphI Msel Nlalll | CviJI Hpyl78III 

I I I I I I 

TTGGGGGAATAATGCTTTTAACATGATGTTGGGAGCTTCTTCTCATTCTTATCCTGAATA 

2401 --. + + + + + + 2460 

AACCCCCTTATTACGAAAATTGTACTACAACCCTCGAAGAAGAGTAAGAATAGGACTTAT 

Hpyl88IX 

BspMI Hinfl I Hpyl78III 

CviRI 1 BsaXI Tfil | Maell | 

II I II II 

CCTGCATTGTTTTGATACCTATGCTCCATACATCAAACTGAATCTGACCTATATACGTCA 

2461 + + + + + + 2520 

GGACGTAACAAAACTATGGATACGAGGTATGTAGTTTGACTTAGACTGGATATATGCAGT 

HpylSBIX Dpnl 

Alul I Bglll I 

CviJI I BstYI I MboII Earl 

Hin4I I Rsal Sau3AI | MboII | Tsp509I | 

II I I I I I II 

GGACAGCTTCTCGGAGAAAGGTACAGAAGGAAGATCTTTTGATGACAGCAACCTCTTCAA 

2521 + + + + + + 2580 

CCTGTCGAAGAGCCTCTTTCCATGTCTTCCTTCTAGAAAACTACTGTCGTTGGAGAAGTT 



Hpyl88IX 
Hin4I I 

Mnll Sfcl HphI I I BplI 

I I III I . 

TTTATCTTTGCCTATAGGGGTGAAGTTTGAGAAGTTCTCTGATTGTAATGACTTTTCTTA 

2581 + + + + + + 2640 

AAATAGAAACGGATATCCCCACTTCAAACTCTTCAAGAGACTAACATTACTGAAAAGAAT 
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BsrDI 
Dpnl I 

Hpyl8 8IX Dpnl Sau3AI | | 

Dpnl I Sau3AI | Alwl | | | Sfcl 

Sau3AI I I Hpyl78IIl| j Acil | | || CviRI | 

Ml III Mill II 

TGATCTGACTTTATCCTATGTTCCTGATCTTATCCGCAATGATCCCAAATGCACTACAGC 

2641 + + + + + + 2700 

ACTAGACTGAAATAGGATACAAGGACTAGAATAGGCGTTACTAGGGTTTACGTGATGTCG 



CviJI 

CviJI Hael 
NlaIV| Haelll 
Acil I I StuI 
MspAlI I I Mnll Ddel Mwol | 

I II I I II 

ACTTGTAATCAGCGGAGCCTCTTGGGAAACTTATGCCAATAACTTAGCACGACAGGCCTT 

2701 + + + + + + 2760 

TGAACATTAGTCGCCTCGGAGAACCCTTTGAATACGGTTATTGAATCGTGCTGTCCGGAA 



CviRI 



Maelll 
Tsp45I 
CacBI I 
CviRI I I 
Cac8I I I j 
III I 



Bsri 
CviJI 
Haelll 
Eael 
Gdill 
BsiHKAI j 
Bspl286I 



Bsgl 
Tthllllll 

II II 
GCAAGTGCGTGCAGGCAGTCACTACGCCTTCTCTCCTATGTTTGAAGTGCTCGGCCAGTT 

2761 + + + + + + 2820 

CGTTCACGCACGTCCGTCAGTGATGCGGAAGAGAGGATACAAACTTCACGAGCCGGTCAA 



Dpnl 
NlalV 

BamHI I Dpnl 

BstYI I Bglll I 

SauSAI I BstYI I Hgal 

Alwl I I Alwl Mnll Sau3AI j Tsp509I 

I I I I I 11 I 

TGTCTTTGAAGTTCGTGGATCCTCACGGATTTATAATGTAGATCTTGGGGGTAAGTTCCA 

2821 + + + + + + 2880 

ACAGAAACTTCAAGCACCTAGGAGTGCCTAAATATTACATCTAGAACCCCCATTCAAGGT 
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Dpnl 
Sau3AI 
BseMII 
Ddel 
Hpyl8 8IX 
BplI 



Bfal 
BslI 



Apol 
Tsp509I 
BsmAI 
Hpyl88IX 
BseMII 
Ddel 
Nlalll 
BsmAI 1 
BsmBI I 
Hin4I I i 

I I I 



Msel 



I 



ATTCTAGGAGCGTCTCTCATGTCTCAGAAATTCTGAGAGAGATCGCATTTAGGATTTTCT 

2881 + + + + + + 2940 

TAAGATCCTCGCAGAGAGTACAGAGTCTTTAAGACTCTCTCTAGCGTAAATCCTAAAAGA 



TAAACACGAC 

2941 + 2950 

ATTTGTGCTG 



^1 p: 



o MAR 0 B 
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Figure 5 Ar: Nucleotide and deduced amino acid sequence of CPN100638 



tgtagattct taacttacta gtctctcctt tcctcttgct ttctttaatt tattgcagta 60 

tgtggtgaaa taatttgtta aaccacctat agccctctac atg aaa tec tct ctt 115 

Met Lys Ser Ser Leu 
1 5 

cat tgg ttt tta ate teg tea tet tta gca ctt ecc ttg tea eta aat 163 
His Trp Phe Leu lie Ser Ser Ser Leu Ala Leu Pro Leu Ser Leu Asn 
10 15 20 

ttc tct gcg ttt get get gtt gtt gaa ate aat eta gga cet aec aat 211 
Phe Ser Ala Phe Ala Ala Val Val Glu lie Asn Leu Gly Pro Thr Asn 
25 30 35 

age ttc tct gga cca gga aec tac act cet cca gee caa aca aca aat 259 
Ser Phe Ser Gly Pro Gly Thr Tyr Thr Pro Pro Ala Gin Thr Thr Asn 
40 45 SO 

gca gat gga act ate tat aat eta aca ggg gat gtc tea ate ace aat 307 
Ala Asp Gly Thr lie Tyr Asn Leu Thr Gly Asp Val Ser lie Thr Asn 
55 60 65 

gca gga tet ccg aca get eta aec get tec tge ttt aaa gaa act act 355 
Ala Gly Ser Pro Thr Ala Leu Thr Ala Ser Cys Phe Lys Glu Thr Thr 
70 75 80 85 

ggg aat ctt tet ttc caa ggc cac ggc tac caa ttt etc eta caa aat 403 
Gly Asn Leu Ser Phe Gin Gly His Gly Tyr Gin Phe Leu Leu Gin Asn 
90 95 • ' 100 

ate gat gcg gga gcg aac tgt aec ttt ace aat aca get gca aat aag 451 
lie Asp Ala Gly Ala Asn Cys Thr Phe Thr Asn Thr Ala Ala Asn Lys 
105 110 115 

ctt etc tec ttt tea gga ttc tec tat ttg tea eta ata caa ace aeg 499 
Leu Leu Ser Phe Ser Gly Phe Ser Tyr Leu Ser Leu lie Gin Thr Thr 
120 125 130 

aat get aec aca gga aca gga gee ate aag tec aca gga get tgt tet 547 
Asn Ala Thr Thr Gly Thr Gly Ala lie Lys Ser Thr Gly Ala Cys Ser 
135 140 145 

att eag teg aac tat agt tge tac ttt ggc caa aac ttt tct aat gac 595 
lie Gin Ser Asn Tyr Ser Cys Tyr Phe Gly Gin Asn Phe Ser Asn Asp 
150 155 160 165 

aat gga ggc gee etc caa ggc age tct ate agt eta teg eta aac ecc 643 
Asn Gly Gly Ala Leu Gin Gly Ser Ser lie Ser Leu Ser Leu Asn Pro 
170 175 180 



aac eta acg ttt gee aaa aac aaa gca aeg caa aaa ggg ggt gee etc 691 
Asn Leu Thr Phe Ala Lys Asn Lys Ala Thr Gin Lys Gly Gly Ala Leu 
185 190 195 



Mar q g 
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tat tec acg gga ggg att aca att aac aat acg tta aac tea gca tea 73 9 
Tyr Ser Thr Gly Gly lie Thr lie Asn Asn Thr Leu Asn Ser Ala Ser 
200 205 210 

ttt tct gaa aat acc gcg gcg aac aat ggc gga gcc att tac acg gaa 787 
Phe Ser Glu Asn Thr Ala Ala Asn Asn Gly Gly Ala lie Tyr Thr Glu 
215 220 225 

get age agt ttt att age age aae aaa gca att age ttt ata aac aat 835 
Ala Ser Ser Phe lie Ser Ser Asn Lys Ala lie Ser Phe lie Asn Asn 
230 235 240 245 

agt gtg acc gca acc tea get aca ggg gga gcc att tac tgt agt agt 883 
Ser Val Thr Ala Thr Ser Ala Thr Gly Gly Ala lie Tyr Cys Ser Ser 
250 255 260 

aca tea gcc ccc aaa cca gtc tta act eta tea gac aae ggg gaa ctg 931 
Thr Ser Ala Pro Lys Pro Val Leu Thr Leu Ser Asp Asn Gly Glu Leu 
265 270 275 

aac ttt ata gga aat aca gca att act agt ggt ggg gcg att tat act 979 
Asn Phe lie Gly Asn Thr Ala lie Thr Ser Gly Gly Ala lie Tyr Thr 
280 285 290 

gac aat eta gtt ctt tct tet gga gga cet acg ctt ttt aaa aae aac 1027 
Asp Asn Leu Val Leu Ser Ser Gly Gly Pro Thr Leu Phe Lys Asn Asn 
295 300 305 

tct ggc tat gat act gca get ccc tta gga gga gca att gcg att get 1075 
Ser Gly Tyr Asp Thr Ala Ala Pro Leu Gly Gly Ala lie Ala lie Ala 
310 315 320 325 



gac tct gga tct ttg agt ctt teg get ctt ggt gga gac ate act ttt 
Asp Ser Gly Ser Leu Ser Leu Ser Ala Leu Gly Gly Asp lie Thr Phe 
330 335 340 



1123 



gaa gga aac aca gta gtc aaa gga get tet teg agt cag acc act acc 1171 
Glu Gly Asn Thr Val Val Lys Gly Ala Ser Ser Ser Gin Thr Thr Thr 
345 350 355 

aga aat tct att aae ate gga aac acc aat get aag att gta cag ctg 1219 
Arg Asn Ser lie Asn lie Gly Asn Thr Asn Ala Lys lie Val Gin Leu 
360 365 370 

ega gee tet caa ggc aat act ate tac tte tat gat ect ata aca act 1267 
Arg Ala Ser Gin Gly Asn Thr lie Tyr Phe Tyr Asp Pro lie Thr Thr 
375 380 385 

age ate act gca get etc tea gat get eta aac tta aat ggt ect gac 1315 
Ser lie Thr Ala Ala Leu Ser Asp Ala Leu Asn Leu Asn Gly Pro Asp 
390 395 400 405 



ctt gca ggg aat ect gca tat caa gga acc ate gta ttt tet gga gag 
Leu Ala Gly Asn Pro Ala Tyr Gin Gly Thr lie Val Phe Ser Gly Glu 
410 415 420 



1363 



ig. 5C-(€efrt^ 



36/77 



ANNOTATED SHEET SHOWING CHANGES 

Title: CHLAMYDIA ANTIGENS AND 
CORRESPONDING DNA FRAGMENTS AND 
USES THEREOF 
Inventor(s): Andrew D. MURDIN et al. 
Appl.No.: 09/857,128 



aag etc teg gaa gca gaa get gca gaa get gat aat ete aaa tct aca 
Lys Leu Ser Glu Ala Glu Ala Ala Glu Ala Asp Asn Leu Lys Ser Thr 
425 430 435 



1411 



att cag caa cct eta act ctt gcg gga ggg caa etc tct ctt aaa tea 
lie Gin Gin Pro Leu Thr Leu Ala Gly Gly Gin Leu Ser Leu Lys Ser 
440 445 450 



1459 



gga gte act eta gtt get aag tec ttt teg caa tct ccg ggc tct acc 
Gly Val Thr Leu Val Ala Lys Ser Phe Ser Gin Ser Pro Gly Ser Thr 
455 460 465 



1507 



etc ete atg gat gca ggg acc aca tta gaa acc get gat ggg ate act 
Leu Leu Met Asp Ala Gly Thr Thr Leu Glu Thr Ala Asp Gly lie Thr 
470 475 480 485 



1555 



ate aat aat ctt gtt etc aat gta gat tec tta aaa gag acc aag aag 
lie Asn Asn Leu Val Leu Asn Val Asp Ser Leu Lys Glu Thr Lys Lys 
490 495 500 



1603 



ggc acg eta aaa gca aca caa gca agt cag aca gte act tta tct gga 

Gly Thr Leu Lys Ala Thr Gin Ala Ser Gin Thr Val Thr Leu Ser Gly 
505 510 515 

teg etc tct ctt gta gat cct tet gga aat gte tac gaa gat gte tct 

Ser Leu Ser Leu Val Asp Pro Ser Gly Asn Val Tyr Glu Asp Val Ser 
520 525 530 



1651 



1699 



tgg aat aac cct caa gte ttt tet tgt ete act ctt act get gae gac 
Trp Asn Asn Pro Gin Val Phe Ser Cys Leu Thr Leu Thr Ala Asp Asp 
535 540 545 



1747 



ecc gcg aat att cac ate aca gae tta get get gat ccc eta gaa aaa 
Pro Ala Asn lie His lie Thr Asp Leu Ala Ala Asp Pro Leu Glu Lys 
550 555 560 565 



1795 



aat cct ate cat tgg gga tac caa ggg aat tgg gca tta tct tgg caa 
Asn Pro lie His Trp Gly Tyr Gin Gly Asn Trp Ala Leu Ser Trp Gin 
570 575 580 



1843 



gag gat act gcg act aaa tee aaa gca gcg act ctt ace tgg aca aaa 
Glu Asp Thr Ala Thr Lys Ser Lys Ala Ala Thr Leu Thr Trp Thr Lys 
585 590 595 



1891 



aca gga tac aat ccg aat cct gag egt egt gga acc tta gtt get aac 
Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg Gly Thr Leu Val Ala Asn 
600 605 610 



1939 



acg eta tgg gga tec ttt gtt gat gtg cge tec ata caa cag ctt gta 
Thr Leu Trp Gly Ser Phe Val Asp Val Arg Ser lie Gin Gin Leu Val 
615 620 625 



1987 



gee act aaa gta cge caa tct caa gaa act cge ggc ate tgg tgt gaa 
Ala Thr Lys Val Arg Gin Ser Gin Glu Thr Arg Gly lie Trp Cys Glu 
630 635 640 645 



2035 
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999 s^tc teg aac ttc ttc cat aaa gat age acg aag ata aat aaa ggt 
Gly lie Ser Asn Phe Phe His Lys Asp Ser Thr Lys lie Asn Lys Gly 
650 655 660 



2083 



ttt cgc cac ata agt gca ggt tat gtt gta gga gcg act aca aca tta 
Phe Arg His lie Ser Ala Gly Tyr Val Val Gly Ala Thr Thr Thr Leu 
665 670 675 



2131 



get tct gat aat ctt ate act gca gee ttc tgc caa tta ttc ggg aaa 
Ala Ser Asp Asn Leu He Thr Ala Ala Phe Cys Gin Leu Phe Gly Lys 
680 685 690 



2179 



gat aga gat cac ttt ata aat aaa aat aga get tet gee tat gca get 
Asp Arg Asp His Phe He Asn Lys Asn Arg Ala Ser Ala Tyr Ala Ala 
695 700 705 



2227 



tct etc cat etc cag cat eta gcg ace ttg tct tct cca age ttg tta 
Ser Leu His Leu Gin His Leu Ala Thr Leu Ser Ser Pro Ser Leu Leu 
710 715 720 725 



2275 



ege tac ctt cet gga tct gaa agt gag cag ect gtc etc ttt gat get 
Arg Tyr Leu Pro Gly Ser Glu Ser Glu Gin Pro Val Leu Phe Asp Ala 
730 735 740 



2323 



cag ate age tat ate tat agt aaa aat act atg aaa ace tat tac ace 
Gin He Ser Tyr He Tyr Ser Lys Asn Thr Met Lys Thr Tyr Tyr Thr 
745 750 755 



2371 



caa gca cca aag gga gag age teg tgg tat aat gae ggt tgc get etg 
Gin Ala Pro Lys Gly Glu Ser Ser Trp Tyr Asn Asp Gly Cys Ala Leu 
760 765 770 



2419 



gaa ctt gcg age tee eta cca cac act get tta age eat gag ggt etc 
Glu Leu Ala Ser Ser Leu Pro His Thr Ala Leu Ser His Glu Gly Leu 
775 780 785 



2467 



ttc eae gcg tat ttt cet ttc ate aaa gta gaa get teg tac ata eae 
Phe His Ala Tyr Phe Pro Phe He Lys Val Glu Ala Ser Tyr He His 
790 795 800 805 



2515 



caa gat age ttc aaa gaa cgt aat act ace ttg gta ega tet ttc gat 
Gin Asp Ser Phe Lys Glu Arg Asn Thr Thr Leu Val Arg Ser Phe Asp 
810 815 820 



2563 



^9c ggt gat tta att aae gte tct gtg cet att gga att ace ttc gag 
Ser Gly Asp Leu He Asn Val Ser Val Pro He Gly He Thr Phe Glu 
825 830 835 



2611 



aga ttc teg aga aac gag cgt gcg tct tac gaa get act gte ate tac 
Arg Phe Ser Arg Asn Glu Arg Ala Ser Tyr Glu Ala Thr Val He Tyr 
840 845 850 



2659 



gtt gee gat gte tat cgt aag aat ect gae tgc acg aca get etc eta 
Val Ala Asp Val Tyr Arg Lys Asn Pro Asp Cys Thr Thr Ala Leu Leu 
855 860 865 



2707 



MAP 0 8 20(K 2} 
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ate 


aac 


aat 


acc 


teg 


tgg 


aaa 


act 


aca 


99a 


acg 


aat 


etc 


tea 


aga 


eaa 


2755 


lie 


Asn 


Asn 


Thr 


Ser 


Trp 


Lys 


Thr 


Thr 


Gly 


Thr 


Asn 


Leu 


Ser 


Arg 


Gin 




870 










875 










880 










885 




get 


ggt 


ate 


gga 


aga 


gea 


999 


ate 


ttt 


tat 


gee 


ttc 


tet 


eea 


aat 


ett 


2803 


Ala 


Gly 


He 


Gly 


Arg 


Ala 


Gly 


He 


Phe 


Tyr 


Ala 


Phe 


Ser 


Pro 


Asn 


Leu 












890 










895 










900 






gag 


gtc 


aca 


agt 


aac 


eta 


tct 


atg 


gaa 


att 


cgt 


gga 


tet 


tea 


cge 


age 


2851 


Glu 


Val 


Thr 


Ser 


Asn 


Leu 


Ser 


Met 


Glu 


He 


Arg 


Gly 


Ser 


Ser 


Arg 


Ser 










905 










910 










915 








tac 


aat 


gca 


gat 


ett 


gga 


ggt 


aag 


ttc 


cag 


ttc 


taaaagcgtt cctgatceet 


2904 


Tyr 


Asn 


Ala 
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Gly 


Gly 
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Phe 


Gin 


Phe 















920 



925 



tagaaattct aagagatcct gagtgtatct agggacttet eaaaga 



2950 
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Figure 6 Ar: Restriction enzyme analysis of CPNl 00638 

Tsp509I 

Hinfl Bfal Msel | 

Tfil Msel Spell BsmAI Mnll || CviRI 

I I II I I II I 

TGTAGATTCTTAACTTACTAGTCTCTCCTTTCCTCTTGCTTTCTTTAATTTATTGCAGTA 

1 + + + + + + 60 

ACATCTAAGAATTGAATGATCAGAGAGGAAAGGAGAACGAAAGAAATTAAATAACGTCAT 

MboII 

Msel CviJI Mnll Mnll 

Tsp509I HphI I Sfcl I Nlallll Earl | 

III II II II 
TGTGGTGAAATAATTTGTTAAACCACCTATAGCCCTCTACATGAAATCCTCTCTTCATTG 
61 + + + + + + 120 

ACACCACTTTATTAAACAATTTGGTGGATATCGGGAGATGTACTTTAGGAGAGAAGTAAC 



Bbvl 
Apol 
Tsp509I 



Msel 



Maelll 
Tsp45I 



Fnu4HI 
Mwol I 
Tsel I 



GTTTTTAATCTCGTCATCTTTAGCACTTCCCTTGTCACTAAATTTCTCTGCGTTTGCTGC 

121 + + + + + + 180 

CAAAAATTAGAGCAGTAGAAATCGTGAAGGGAACAGTGATTTAAAGAGACGCAAACGACG 



NlalV 
BseRI 
Bpml 
ScrFI 
EcoRII 

Avail Avail 
ECOO109I BspGI 
PspSII Sau96I 
Sau96I Hpyl78IIl| | | ||BsaXI 

Sse8647I Alul II I I I I Aid I 

Bfal I CviJI I I I I II Ppil 1 

II I II I I II II 

TGTTGTTGAAATCAATCTAGGACCTACCAATAGCTTCTCTGGACCAGGAACCTACACTCC 

181 + + + + + + 240 

ACAACAACTTTAGTTAGATCCTGGATGGTTATCGAAGAGACCTGGTCCTTGGATGTGAGG 



Bed 

Mnll Tthllllll 
CviJI I CviRI I I HphI BsmAI 

II I II II 

TCCAGCCCAAACAACAAATGCAGATGGAACTATCTATAATCTAACAGGGGATGTCTCAAT 

241 + + + + + + 300 

AGGTCGGGTTTGTTGTTTACGTCTACCTTGATAGATATTAGATTGTCCCCTACAGAGTTA 



Fig. 6 B>{ j3o n't) 
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Alul 
CviJI 
Alwl 
HpylSSIX 
Dpnl 



BstYI 
Sau3AI 
Fokl CviRI I 
I I I 



Bsri 
Hinf I 
Tf il 
Bmrl I 



301 



Acelll Dral 
Acil I Msel| 
j j j j Mwol I I Mmel I I 

II I III II 
CACCAATGCAGGATCTCCGACAGCTCTAACCGCTTCCTGCTTTAAAGAAACTACTGGGAA 
+ + + + + + 350 

GTGGTTACGTCCTAGAGGCTGTCGAGATTGGCGAAGGACGAAATTTCTTTGATGACCCTT 



BsaJI 

BstDSI Clal 

CviJI I TaqI 

Hael I Faul 

Haelllj Sthl32l| 

BsaJI I I Tsp509I SfaNI | | 
Styl I I CviJI I Beef I | | | | Acil 

I II I I I I II I I 

TCTTTCTTTCCAAGGCCACGGCTACCAATTTCTCCTACAAAATATCGATGCGGGAGCGAA 

361 + + + + + + 420 

AGAAAGAAAGGTTCCGGTGCCGATGGTTAAAGAGGATGTTTTATAGCTACGCCCTCGCTT 



Bbvl 
Rsal I 
Taal I I 

I I I 



Alul 
CviJI 
Hindlll 
CviRI 
Fnu4HI 

Alul 
CviJI 
MspAlI 
PvuII 
Tsel 



Hinf I 
Tf il 
Hpyl78III I 



Maelll 
Tsp45I 
I 



CTGTACCTTTACCAATACAGCTGCAAATAAGCTTCTCTCCTTTTCAGGATTCTCCTATTT 

421 + + + + + + 

GACATGGAAATGGTTATGTCGACGTTTATTCGAAGAGAGGAAAAGTCCTAAGAGGATAAA 



480 



481 



Bed 

CviJI I Alul 
BsmI NlaIV| I CviJI 

I III I 
GTCACTAATACAAACCACGAATGCTACCACAGGAACAGGAGCCATCAAGTCCACAGGAGC 
+ + ^ + + + 

CAGTGATTATGTTTGGTGCTTACGATGGTGTCCTTGTCCTCGGTAGTTCAGGTGTCCTCG 



540 
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CviJI 

Hael 
Haelll 

MscI 
Eael I 
Mwol I I 

Taqi Sfcl CjePI | j | Mnll 

II I III I 

TTGTTCTATTCAGTCGAACTATAGTTGCTACTTTGGCCAAAACTTTTCTAATGACAATGG 

AACAAGATAAGTCAGCTTGATATCAACGATGAAACCGGTTTTGAAAAGATTACTGTTACC 



600 



BsaJI 
Hael I 
Hhal 



Hin4I 
NlalV 
BsaHI I 
Narl I 
BanI I j 
CjePI I I 
III 



Alul 
CviJI 
Mnll I 
Fnu4HI I I 
Styl Tsel | | j 
I Mil 



Acelll 
Bbvl I 
BplI I I 
I I I 



CjePI 
I 



Acll 
Mae 1 1 



AGGCGCCCTCCAAGGCAGCTCTATCAGTCTATCGCTAAACCCCAACCTAACGTTTGCCAA 

601 + + + + + + 660 

TCCGCGGGAGGTTCCGTCGAGATAGTCAGATAGCGATTTGGGGTTGGATTGCAAACGGTT 



Sthl32I 
Bspl286I 
Bmgl 
BseSI 
NlaIV| 
BanI I I 
I II 



BslI 
BscGI 
Mnll 
BsaJI I 
BstDSI I 
Mnll I I 

II I 



Msel 
Tsp509I I 
I I 



AAACAAAGCAACGCAAAAAGGGGGTGCCCTCTATTCCACGGGAGGGATTACAATTAACAA 

661 + + + + + + 720 

TTTGTTTCGTTGCGTTTTTCCCCCACGGGAGATAAGGTGCCCTCCCTAATGTTAATTGTT 



Fnu4HI 
SacII 
Taul 
Acil I 
MspAlI I 

Thai I CviJI 
BseMII Acil I I Nlaiv| 

Hpyl88IX BsaJI | | Ecil | | 

SfaNl| BstDSI II Acilj II 

II I II II II 

TACGTTAAACTCAGCATCATTTTCTGAAAATACCGCGGCGAACAATGGCGGAGCCATTTA 
+ + + + + + 

ATGCAATTTGAGTCGTAGTAAAAGACTTTTATGGCGCCGCTTGTTACCGCCTCGGTAAAT 



Msel 
Mae I I I Ddel 
I I I 



721 



Fig. 6 0 (Gon^t) 
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Cac8I 
Bf al 
Alul I 

CviJljl Fnu4HI Bbvl Alul Maelll 

Nheljj Tsel| Tsp509I CviJI Tsp45I 

III II I I I 

CACGGAAGCTAGCAGTTTTATTAGCAGCAACAAAGCAATTAGCTTTATAAACAATAGTGT 

781 + + + + + + 840 

GTGCCTTCGATCGTCAAAATAATCGTCGTTGTTTCGTTAATCGAAATATTTGTTATCACA 



CviJI 
NlalV 
BseMII 
Mnll 
Cjel 



Sf cl 
Alul 
CviJI 
Mwol 
BbvCI I 
BpulOI I 
Acil Ddelj 
I II 



841 



Taal Rsal Cjel 
Sfcl| TatI I CviJI I BsrI 

II II II I 

GACCGCAACCTCAGCTACAGGGGGAGCCATTTACTGTAGTAGTACATCAGCCCCCAAACC 

+ + + + + + 

CTGGCGTTGGAGTCGATGTCCCCCTCGGTAAATGACATCATCATGTAGTCGGGGGTTTGG 



900 



Hpyl88IX Tsp509I Bfal 

Msel Sthl32I | BscGI Cjel | Spel | 

I I I I I I II 

AGTCTTAACTCTATCAGACAACGGGGAACTGAACTTTATAGGAAATACAGCAATTACTAG 

901 + + + + + + 960 

TCAGAATTGAGATAGTCTGTTGCCCCTTGACTTGAAATATCCTTTATGTCGTTAATGATC 



Cjel 



MboII 
Bfal I 
II 



Avail 
ECOO109I 
PspSII 
Sau96I 
Sse8647I 
Hpyl78III I 
Mnll I I 

I I I 



Bpml 
Dral I 
Msel I I 
II I 



TGGTGGGGCGATTTATACTGACAATCTAGTTCTTTCTTCTGGAGGACCTACGCTTTTTAA 

961 + + + + + + 1020 

ACCACCCCGCTAAATATGACTGTTAGATCAAGAAAGAAGACCTCCTGGATGCGAAAAATT 




Fig. 6&{ee«^ 
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Alul 
CviJI 
PstI 
BsaXI 
Fnu4HI 
CviRI I 
Tsel I 
Sfcl I I 
I II 



1021 



Bbvl I 
BslI I 
I 

AAACAACTCTGGCTATGATACTGCAGCTCCCTTAGGAGGAGCAATTGCGATTGCTGACTC 
+ + + + + + 1080 

TTTGTTGAGACCGATACTATGACGTCGAGGGAATCCTCCTCGTTAACGCTAACGACTGAG 



HaelV 
BseRI Hin4I 
Plel|HinfI I 

II II 



Dpnl 

BstYI I BsmAI 

Sau3AI I Alwl CviJI I 

Hpyl78IIl| I Hinfl Plel | | Taal 

II I I I I I I 

TGGATCTTTGAGTCTTTCGGCTCTTGGTGGAGACATCACTTTTGAAGGAAACACAGTAGT 

1081 + + + + + + 1140 

ACCTAGAAACTCAGAAAGCCGAGAACCACCTCTGTAGTGAAAACTTCCTTTGTGTCATCA 

Alul Hpyl88IX 
CviJI Hinfl | Apol Hpyl88IX 

MboII I Taqi I I Plel Tsp509I Msel | 

I I I I I I I I I 

CAAAGGAGCTTCTTCGAGTCAGACCACTACCAGAAATTCTATTAACATCGGAAACACCAA 

1141 + + + + + + 1200 

GTTTCCTCGAAGAAGCTCAGTCTGGTGATGGTCTTTAAGATAATTGTAGCCTTTGTGGTT 



CviJI 
CacSI 
Fnu4HI 

Alul 
CviJI 
MspAlI 
PvuII 
Bce83I I 

Rsal I I II Dpnl 
Bbvl BsrGI I I I I I Sau3AI | 

Ddel TatI |Tsel| j | Smll Mnll Alwl | | 

I I I II I I I I III 

TGCTAAGATTGTACAGCTGCGAGCCTCTCAAGGCAATACTATCTACTTCTATGATCCTAT 

1201 + + + + + + 1260 

ACGATTCTAACATGTCGACGCTCGGAGAGTTCCGTTATGATAGATGAAGATACTAGGATA 



■■o 



MAR" 
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Acelll 
Bbvl 
Hpyl8 8IX 
Alul 
CviJI 
PstI 



SfaNI 
TspRI 
Fnu4HI 
SfaNI 
CviRI 
Tsel 
BtsI I 
Sfcl I 

I I 



Hpyl78III 
Msel Avail I 
BseMII |Sau96I j 



tsLAi. £3j.(jj. I I I I JJaej. I I I ±3seiM±± |&a.uyD± | CviRI 

I I MM MM II II I 

AACAACTAGCATCACTGCAGCTCTCTCAGATGCTCTAAACTTAAATGGTCCTGACCTTGC 

1261 + + + + + + 13 

TTGTTGATCGTAGTGACGTCGAGAGAGTCTACGAGATTTGAATTTACCAGGACTGGAACG 

HpylBBIX 
Bbvl 
Alul 

Bed CviJI 
Hinfl Drdll | Hin4l| 

Tfil CviRI NlalVl j Hpyl78III Hin4I | j | | Bpml 

I I III I I II 

AGGGAATCCTGCATATCAAGGAACCATCGTATTTTCTGGAGAGAAGCTCTCGGAAGCAGA 

1321 + + + + + + 1380 

TCCCTTAGGACGTATAGTTCCTTGGTAGCATAAAAGACCTCTCTTCGAGAGCCTTCGTCT 



20 



Alul 
AlwNI 
CviJI 
Mwol 
PstI 
CviRI 
BplI 
Fnu4HI 
Sfcl 
Alul 
AlwNI 
BplI 
BstAPI 

CviJI I I I I I I Acil 
Mwol III II I Faul Mnll I 

Tseljjl II I Tsp509I Sthl32 I | Mnll | | 

MM III I II II I 

AGCTGCAGAAGCTGATAATCTCAAATCTACAATTCAGCAACCTCTAACTCTTGCGGGAGG 

1381 + + + + + + 

TCGACGTCTTCGACTATTAGAGTTTAGATGTTAAGTCGTTGGAGATTGAGAACGCCCTCC 



1440 



MAR 0 8 20M J 

IJ 



Fig. 6<S (con't) ' 
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Hin4I 



Maelll 
Tsp45I 
Hinf I 
Bplll 
Hpyl78III I I 
Msel I I I 

I III 



Bfal 

Plel Ddel Sthl32I 

I I I 

GCAACTCTCTCTTAAATCAGGAGTCACTCTAGTTGCTAAGTCCTTTTCGCAATCTCCGGG 

1441 + + + + + + 1500 

CGTTGAGAGAGAATTTAGTCCTCAGTGAGATCAACGATTCAGGAAAAGCGTTAGAGGCCC 



CviJI 
BseRI 
Neil I 
ScrFI I 
Mspl| I 
II I 



Sf aNI 
Banll I 
Bspl286I I 
I I 



1501 



Fokl 
Cjel 
NlalV 
Avail 



Sau96I 
CviRI 
Mnll 
Mnll I 
Nlallll I 
BslI I I I 
I I I I 



Alwl 
BsaBI 
Dpnl 

MspAlI Sau3AI | 
Acil I Bed I I 
BsmFI I |Bs1I| I j 

III II I I 

CTCTACCCTCCTCATGGATGCAGGGACCACATTAGAAACCGCTGATGGGATCACTATCAA 
+ + + + + + 1560 

GAGATGGGAGGAGTACCTACGTCCCTGGTGTAATCTTTGGCGACTACCCTAGTGATAGTT 



Bspl286I 

Bsal CacBI 
Hinfl BsmAI Bmgl | 

Cjel Tfil Msel BseSI | Bsbl 

I II II I 

TAATCTTGTTCTCAATGTAGATTCCTTAAAAGAGACCAAGAAGGGCACGCTAAAAGCAAC 

1561 + + + + + + 1620 

ATTAGAACAAGAGTTACATCTAAGGAATTTTCTCTGGTTCTTCCCGTGCGATTTTCGTTG 



Tthlllll 
Maelll 
Taal 
Tsp45I 
HpylSSIX I 
Mwol Cjel I j 
I III 



Hpyl78III 
Dpnl 
BstYI 



Sau3AI 
Cjel I 
Alwl I I 
Alwl I I I 

I III 

ACAAGCAAGTCAGACAGTCACTTTATCTGGATCGCTCTCTCTTGTAGATCCTTCTGGAAA 

1621 + + + + + + 1680 

TGTTCGTTCAGTCTGTCAGTGAAATAGACCTAGCGAGAGAGAACATCTAGGAAGACCTTT 



Dpnl 
SauSAI I 
Hpyl78IIl| I 



Pf 111081 
AccI I 
CjePlI I 
II I 



MboII 
BsmAI I 
Bce83I I I 

I I I 



CjePI 
Smll I Mnll 

I I I 



BsmAI 



TGTCTACGAAGATGTCTCTTGGAATAACCCTCAAGTCTTTTCTTGTCTCACTCTTACTGC 

1681 + + + + + + 1740 

ACAGATGCTTCTACAGAGAACCTTATTGGGAGTTCAGAAAAGAACAGAGTGAGAATGACG 



Fig. 6W"(G©faf 
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Faul 
Sspl 
Sthl32I I 
Thai I I 
SimI Acil I I I 
I I I II 



Dpnl 
Sau3AI 
Fnu4HI 
Alul 



1741 



Alwl I 
Cvi JI I 
Tsel I 

Bbvl Ddel II II Bfal 

I I 

TGACGACCCCGCGAATATTCACATCACAGACTTAGCTGCTGATCCCCTAGAAAAAAATCC 
+ + + + + + 1800 

ACTGCTGGGGCGCTTATAAGTGTAGTGTCTGAATCGACGACTAGGGGATCTTTTTTTAGG 



Tsp509I 

BsaJI I BciVI 

BciVI Styl I Mnll | 

I II II 

TATCCATTGGGGATACCAAGGGAATTGGGCATTATCTTGGCAAGAGGATACTGCGACTAA 

1801 + + + + + + 1860 

ATAGGTAACCCCTATGGTTCCCTTAACCCGTAATAGAACCGTTCTCCTATGACGCTGATT 



BpulOI 
Ddel 
Hpyl78III 
Hinf I 

Hinfl BspGI Tfil 

Fnu4HI I ScrFI | Hgal | 

Tsel I I Bbvl I I HpylSSIX | 

Plel I I I EcoRII I I BciVI BseMII | | 

III I II I I III 

ATCCAAAGCAGCGACTCTTACCTGGACAAAAACAGGATACAATCCGAATCCTGAGCGTCG 

1861 + + + + + + 

TAGGTTTCGTCGCTGAGAATGGACCTGTTTTTGTCCTATGTTAGGCTTAGGACTCGCAGC 



1920 



Cjel 
Ddel I 
NlalV I I 
I II 



Dpnl 
NlalV 
BamHI 
BstYI 
Sau3AI 
Alwl I 



Alwl 



Hhal 
Hin4I I 
Cjel I I 
I I I 



TGGAACCTTAGTTGCTAACACGCTATGGGGATCCTTTGTTGATGTGCGCTCCATACAACA 

1921 + + + + + + 1980 

ACCTTGGAATCAACGATTGTGCGATACCCCTAGGAAACAACTACACGCGAGGTATGTTGT 



Alul Bce83I Hpyl78III 

CviJI CviJI I Rsal Smll | 

I III II 

GCTTGTAGCCACTAAAGTACGCCAATCTCAAGAAACTCGCGGCATCTGGTGTGAAGGGAT 

1981 + + + + + + 2040 

CGAACATCGGTGATTTCATGCGGTTAGAGTTCTTTGAGCGCCGTAGACCACACTTCCCTA 



Fnu4HI 
Taul 
Acil I 
Thai I 

I I 



Dpnl 
BstYI I 
Sau3AI I 
SfaNI I I 

I I I 



m 

V 



0 zi0> 



Ui 



Aim 
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XmnI 
Alwl 
Hpyl78III I 
TaqI i 
MboII I I 
I 



MslI 
BspMI ICviRI 



MboII 

,, . , I I I I 

CTCGAACTTCTTCCATAAAGATAGCACGAAGATAAATAAAGGTTTTCGCCACATAAGTGC 

2041 + + + + + + 2100 

GAGCTTGAAGAAGGTATTTCTATCGTGCTTCTATTTATTTCCAAAAGCGGTGTATTCACG 

CviJI 
PstI 
TspRI 
Fnu4HI 
CviRI 

Hpyl88IX Tsel 
Alul I Btsi I 

Bsgl CviJI j Sfcl j 

I II II 

AGGTTATGTTGTAGGAGCGACTACAACATTAGCTTCTGATAATCTTATCACTGCAGCCTT 

2101 + + + + + + 2160. 

TCCAATACAACATCCTCGCTGATGTTGTAATCGAAGACTATTAGAATAGTGACGTCGGAA 



Bbvl 
Tsp509I 
Sthl32I |Hpyl78III 
I I I 



Dpnl 
Sau3AI I 
BsaBlj I 

II I 



Alul 
CviJI 

I 



CTGCCAATTATTCGGGAAAGATAGAGATCACTTTATAAATAAAAATAGAGCTTCTGCCTA 

2161 + + + + + + 2220 

GACGGTTAATAAGCCCTTTCTATCTCTAGTGAAATATTTATTTTTATCTCGAAGACGGAT 



Alul 
CviJI 
Fnu4HI 
Bpml I 
CviRI I 
Mwol I 
Tsel I 



Bbvl 
Bed 



Tthllll 
SfaNI 
Bbsl I 
MboII I 
Bfal I I 



Mwol 
Maelll 
Alul I 
CviJI I 
Hindi I I I I 
I I I 

TGCAGCTTCTCTCCATCTCCAGCATCTAGCGACCTTGTCTTCTCCAAGCTTGTTACGCTA 

2221 + + + + + + 2280 

ACGTCGAAGAGAGGTAGAGGTCGTAGATCGCTGGAACAGAAGAGGTTCGAACAATGCGAT 



Hpyl8 8IX 
Dpnl 
BstYI 
Sau3AI 
ScrFI I 
EcoRII I I 
I I I 



SfaNI 
CviJI I 
Fnu4HI I I 
Alwl Tsel I 



Bbvl 



Sfcl 
BseMII 
Alul 
CviJI 
Dpnl 



SauBAI 
Hpyl8 8IX I 
Ddel I I 
Mnll I I I 



CCTTCCTGGATCTGAAAGTGAGCAGCCTGTCCTCTTTGATGCTCAGATCAGCTATATCTA 

2281 + 4- + + + + 2340 

GGAAGGACCTAGACTTTCACTCGTCGGACAGGAGAAACTACGAGTCTAGTCGATATAGAT 
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Banll 
BsiHKAI 
Bspl286I 
Sad 
BssSI I 
Alul I 
Cvi JI I 
Tthlllll I 
CjePI Taqll | j 

I III 

TAGTAAAAATACTATGAAAACCTATTACACCCAAGCACCAAAGGGAGAGAGCTCGTGGTA 

2341 + + + + ' + + 2400 

ATCATTTTTATGATACTTTTGGATAATGTGGGTTCGTGGTTTCCCTCTCTCGAGCACCAT 



Banll Nlalll 

BsiHKAI MboII 

Bspl286I CviJI 

Sad Mnll 

Alul I Msel I 

CviJI I TspRI I 1 
CacSI I I BtsI I I j 

I I I I II I 

TAATGACGGTTGCGCTCTGGAACTTGCGAGCTCCCTACCACACACTGCTTTAAGCCATGA 

2401 + + + + + + 2460 

ATTACTGCCAACGCGAGACCTTGAACGCTCGAGGGATGGTGTGTGACGAAATTCGGTACT 



Hpyl78III 
Hhal I 
CjePI I I 
Taal I I I 
I I I I 



Thai 
Af IIII 
Mlul 

Earl I I Alul 
Bsal I I I CviJI 
SimI BsmAI I I I Hindlll | Rsal 

I I II I III 

GGGTCTCTTCCACGCGTATTTTCCTTTCATCAAAGTAGAAGCTTCGTACATACACCAAGA 

2461 + + + + + + 2520 

CCCAGAGAAGGTGCGCATAAAAGGAAAGTAGTTTCATCTTCGAAGCATGTATGTGGTTCT 



Alul 
CviJI 



Mae 1 1 



Dpnl 

BsaJI Sau3AI | 
Styl Rsal I I Taqi Acil 
I IIII I 



Mae II 
HphI 
Msel 
Pad 
Tsp509I I 
Msel I I 
II I 



TAGCTTCAAAGAACGTAATACTACCTTGGTACGATCTTTCGATAGCGGTGATTTAATTAA 

2521 + + + + + + 2580 

ATCGAAGTTTCTTGCATTATGATGGAACCATGCTAGAAAGCTATCGCCACTAAATTAATT 



0 6?D0^ 



Fig. 6><- (con't) 
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Hgal 
Hpyl78III 
Hpyl78III 
Taqi 
Aval I 
Smllj 
Hin4I II.. 
Hinfl I I 1 j BplI 
BsmAI Hpyl78III |xhol| | j Cac8l| 

BsmBI Tsp509I TaqI |TfiI || I I Hin4I 

I I 1 I I II , . 

CGTCTCTGTGCCTATTGGAATTACCTTCGAGAGATTCTCGAGAAACGAGCGTGCGTCTTA 

2581 + + + + + + 2640 

GCAGAGACACGGATAACCTTAATGGAAGCTCTCTAAGAGCTCTTTGCTCGCACGCAGAAT 

Hpyl78III 

Alul Bsgl Hinfl | Alul 

CviJI Taal Maell BsaBI | Tfil | CviRI CviJI 

II I I I I I I I 

CGAAGCTACTGTCATCTACGTTGCCGATGTCTATCGTAAGAATCCTGACTGCACGACAGC 

2641 + + + + + + 2700 

GCTTCGATGACAGTAGATGCAACGGCTACAGATAGCATTCTTAGGACTGACGTGCTGTCG 

Bce83I Hpyl78III 
Sfcl| Hinfl I Alul 

Acelll BssSI Mnll I I Tfil Smll | CviJI 

I I III I I I I 

TCTCCTAATCAACAATACCTCGTGGAAAACTACAGGAACGAATCTCTCAAGACAAGCTGG 

2701 + + + + + + 2760 

AGAGGATTAGTTGTTATGGAGCACCTTTTGATGTCCTTGCTTAGAGAGTTCTGTTCGACC 



MboII Maelll 

Dpnl I Tsp4 5I 

Hpyl88IX BstYI | | Hpyl78III | 

Earl I Sau3AI | | Smll | j 

Sapl I Hin4I | j j Alwl Mnll | | | Maelll 

II II II I I II I I 

TATCGGAAGAGCAGGGATCTTTTATGCCTTCTCTCCAAATCTTGAGGTCACAAGTAACCT 

2761 + + + + + + 2820 

ATAGCCTTCTCGTCCCTAGAAAATACGGAAGAGAGGTTTAGAACTCCAGTGTTCATTGGA 



Dpnl 
BstYI 
Sau3AI 
MboII 
Apol 



Tsp509I 
BslI I 
Bce83I I I 

I I I 



Alul 
CviJI 
Fnu4HI I 
Tsel| I 
Alwl I I j 

I II I 



Dpnl 
Mnll 
Bglll 
BstYI 
Sau3AI 
Bbvl I 
CviRI I I 

I I I 



Bsri 



ATCTATGGAAATTCGTGGATCTTCACGCAGCTACAATGCAGATCTTGGAGGTAAGTTCCA 

2821 + + + + + + 2880 

TAGATACCTTTAAGCACCTAGAAGTGCGTCGATGTTACGTCTAGAACCTCCATTCAAGGT 
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Ddel 
Hpyl78III 
Dpnl 
BstYI 
Sau3AI 
Dpnl BseMII 
Sau3AI I Apol I 

Hpyl78IIl| I Tsp509I Alwl | 
Alwl II I Ddel I Ddel I | | | Bfal 

I II I I I II II I I 

GTTCTAAAAGCGTTCCTGATCCCTTAGAAATTCTAAGAGATCCTGAGTGTATCTAGGGAC 

2881 + + + + + + 2940 

CAAGATTTTCGCAAGGACTAGGGAATCTTTAAGATTCTCTAGGACTCACATAGATCCCTG 



BsmFI 



TTCTCAAAGA 

2941 + 2950 

AAGAGTTTCT 



20Q4 
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^AO^?^qp^gure 7 A: Nucleotide and amino acid sequence of CPN100639 



gtacgaagtt cttcacgaaa ttataataca aacctaggct ctaagttttg tttctagatt 60 

atcgaaaacg tgttaattaa ttgaacccaa gcatctttct atg aaa ata ccc ttg 115 

Met Lys lie Pro Leu 
1 5 



cac aaa etc ctg ate tet teg act ctt gte act cee att eta ttg age 
His Lys Leu Leu lie Ser Ser Thr Leu Val Thr Pro lie Leu Leu Ser 
10 15 20 



163 



att gca act tac gga gca gat get tet tta tee ect aca gat age ttt 
lie Ala Thr Tyr Gly Ala Asp Ala Ser Leu Ser Pro Thr Asp Ser Phe 
25 30 35 



211 



gat gga gcg ggc ggc tet aca ttt act eca aaa tet aca gca gat gee 
Asp Gly Ala Gly Gly Ser Thr Phe Thr Pro Lys Ser Thr Ala Asp Ala 
40 45 50 



259 



aat gga acg aac tat gtc tta tea gga aat gte tat ata aac gat get 
Asn Gly Thr Asn Tyr Val Leu Ser Gly Asn Val Tyr lie Asn Asp Ala 
55 60 65 



307 



ggg aaa ggc aca gca tta aca ggc tgc tgc ttt aca gaa act acg ggt 
Gly Lys Gly Thr Ala Leu Thr Gly Cys Cys Phe Thr Glu Thr Thr Gly 
70 75 80 85 



355 



gat ctg aca ttt act gga aag gga tac tea ttt tea ttc aac acg gta 
Asp Leu Thr Phe Thr Gly Lys Gly Tyr Ser Phe Ser Phe Asn Thr Val 
90 95 100 



403 



gat gcg ggt teg aat gca gga get gcg gca age aca act get gat aaa 
Asp Ala Gly Ser Asn Ala Gly Ala Ala Ala Ser Thr Thr Ala Asp Lys 
105 110 115 



451 



gee eta ate ttc aca gga ttt tet aac ctt tec ttc att gca get ect 
Ala Leu lie Phe Thr Gly Phe Ser Asn Leu Ser Phe lie Ala Ala Pro 
120 125 130 



499 



gga act aca gtt get tea gga aaa agt act tta agt tet gca gga gee 
Gly Thr Thr Val Ala Ser Gly Lys Ser Thr Leu Ser Ser Ala Gly Ala 
135 140 145 



547 



tta aat ctt ace gat aat gga acg att etc ttt age caa aac gtc tee 
Leu Asn Leu Thr Asp Asn Gly Thr lie Leu Phe Ser Gin Asn Val Ser 
150 155 160 165 



595 



aat gaa get aat aac aat ggc gga gcg ate ace aca aaa act ctt tet 
Asn Glu Ala Asn Asn Asn Gly Gly Ala lie Thr Thr Lys Thr Leu Ser 
170 175 180 



643 



att tet ggg aat ace tet tet ata ace ttc act agt aat age gca aaa 
lie Ser Gly Asn Thr Ser Ser lie Thr Phe Thr Ser Asn Ser Ala Lys 
185 190 195 



691 
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aaa tta ggt gga gcg ate tat age tct gcg get gca agt att tea gga 
Lys Leu Gly Gly Ala lie Tyr Ser Ser Ala Ala Ala Ser lie Ser Gly 
200 205 210 



739 



aac ace gge eag tta gte ttt atg aat aat aaa gga gaa act ggg ggt 787 
Asn Thr Gly Gin Leu Val Phe Met Asn Asn Lys Gly Glu Thr Gly Gly 
215 220 225 

999 9ct ctg ggc ttt gaa gee age tec teg att aet caa aat age tee 835 
Gly Ala Leu Gly Phe Glu Ala Ser Ser Ser lie Thr Gin Asn Ser Ser 
230 235 240 245 

ett tte ttc tct gga aae aet gca aca gat get gca ggc aag ggc ggg 883 
Leu Phe Phe Ser Gly Asn Thr Ala Thr Asp Ala Ala Gly Lys Gly Gly 
250 255 260 

gee att tat tgt gaa aaa aca gga gag aet eet aet ett aet ate tct 931 
Ala lie Tyr Cys Glu Lys Thr Gly Glu Thr Pro Thr Leu Thr lie Ser 
265 270 275 

gga aat aaa agt etg ace tte gee gag aae tet tea gta aet eaa ggc 979 
Gly Asn Lys Ser Leu Thr Phe Ala Glu Asn Ser Ser Val Thr Gin Gly 
280 285 290 

gga gca ate tgt gee eat ggt eta gat ett tee get get ggc cct ace 1027 
Gly Ala lie Cys Ala His Gly Leu Asp Leu Ser Ala Ala Gly Pro Thr 
295 300 305 

eta ttt tea aat aat aga tgc ggg aac aca get gca gge aag gge gge 1075 
Leu Phe Ser Asn Asn Arg Cys Gly Asn Thr Ala Ala Gly Lys Gly Gly 
310 315 320 325 

get att gca att gee gac tct gga tct tta agt etc tct gea aat eaa 1123 
Ala lie Ala lie Ala Asp Ser Gly Ser Leu Ser Leu Ser Ala Asn Gin 
330 335 340 

gga gae ate aeg ttc ett gge aae aet eta aee tea ace tec gcg cea 1171 
Gly Asp lie Thr Phe Leu Gly Asn Thr Leu Thr Ser Thr Ser Ala Pro 
345 350 355 

aca teg aca egg aat get ate tae etg gga teg tea gea aaa att acg 1219 
Thr Ser Thr Arg Asn Ala lie Tyr Leu Gly Ser Ser Ala Lys lie Thr 
360 365 370 

aae tta agg gca gee caa ggc caa tct ate tat tte tat gat eeg att 1267 
Asn Leu Arg Ala Ala Gin Gly Gin Ser lie Tyr Phe Tyr Asp Pro lie 
375 380 385 

gca tct aac ace aca gga get tea gac gtt ctg ace ate aac caa eeg 1315 
Ala Ser Asn Thr Thr Gly Ala Ser Asp Val Leu Thr lie Asn Gin Pro 
390 395 400 405 



gat age aac teg cct tta gat tat tea gga aeg att gta ttt tct ggg 
Asp Ser Asn Ser Pro Leu Asp Tyr Ser Gly Thr lie Val Phe Ser Gly 
410 415 420 



1363 
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gaa aag etc tct gca gat gaa gcg aaa get get gat aac ttc aca tct 1411 
Glu Lys Leu Ser Ala Asp Glu Ala Lys Ala Ala Asp Asn Phe Thr Ser 
425 430 435 
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ata tta aag caa cca ttg get eta gee tct gga acc tta gca etc aaa 1459 
lie Leu Lys Gin Pro Leu Ala Leu Ala Ser Gly Thr Leu Ala Leu Lys 
440 445 450 

gga aat gtc gag tta gat gtc aat ggt ttc aca cag act gaa gge tct 1507 
Gly Asn Val Glu Leu Asp Val Asn Gly Phe Thr Gin Thr Glu Gly Ser 
455 460 465 

aca etc etc atg caa cca gga aca aag etc aaa gca gat act gaa get 1555 
Thr Leu Leu Met Gin Pro Gly Thr Lys Leu Lys Ala Asp Thr Glu Ala 
470 475 480 485 

ate agt ett acc aaa ctt gtc gtt gat ctt tet gee tta gag gga aat 1603 
lie Ser Leu Thr Lys Leu Val Val Asp Leu Ser Ala Leu Glu Gly Asn 
490 495 500 

aag agt gtg tec att gaa aca gca gga gee aac aaa act ata act eta 1651 
Lys Ser Val Ser lie Glu Thr Ala Gly Ala Asn Lys Thr lie Thr Leu 
505 510 515 

acc tct cct ctt gtt ttc caa gat agt age ggc aat ttt tat gaa age 1699 
Thr Ser Pro Leu Val Phe Gin Asp Ser Ser Gly Asn Phe Tyr Glu Ser 
520 525 530 

cat acg ata aac caa gee ttc acg cag cct ttg gtg gta ttc act get 1747 
His Thr lie Asn Gin Ala Phe Thr Gin Pro Leu Val Val Phe Thr Ala 
535 540 545 

get act get get age gat att tat ate gat gcg ctt etc act tct cca 1795 
Ala Thr Ala Ala Ser Asp lie Tyr lie Asp Ala Leu Leu Thr Ser Pro 
550 555 560 565 

gta caa act eca gaa cct cat tac ggg tat cag gga cat tgg gaa gee 1843 
Val Gin Thr Pro Glu Pro His Tyr Gly Tyr Gin Gly His Trp Glu Ala 
570 575 580 

act tgg gca gac aca tea act gca aaa tea gga act atg act tgg gta 1891 
Thr Trp Ala Asp Thr Ser Thr Ala Lys Ser Gly Thr Met Thr Trp Val 
585 590 595 

act acg ggc tac aac cct aat cct gag cgt aga get tec gta gtt cec 1939 
Thr Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg Ala Ser Val Val Pro 
600 605 610 

gat tea tta tgg gca tec ttt act gac att egc act eta cag cag ate 1987 
Asp Ser Leu Trp Ala Ser Phe Thr Asp lie Arg Thr Leu Gin Gin lie 
615 620 625 



atg aca tct caa gcg aat agt ate tat cag caa cga gga etc tgg gca 
Met Thr Ser Gin Ala Asn Ser lie Tyr Gin Gin Arg Gly Leu Trp Ala 
630 635 640 645 



2035 



4 
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tea gga act gcg aat ttc ttc cat aag gat aaa tea gga act aac caa 
Ser Gly Thr Ala Asn Phe Phe His Lys Asp Lys Ser Gly Thr Asn Gin 
650 655 660 



2083 



gca ttc cga cat aaa age tac gge tat att gtt gga gga agt get gaa 
Ala Phe Arg His Lys Ser Tyr Gly Tyr lie Val Gly Gly Ser Ala Glu 
665 670 675 



2131 



gat ttt tet gaa aat ate ttc agt gta get ttc tgc cag etc ttc ggt 
Asp Phe Ser Glu Asn lie Phe Ser Val Ala Phe Cys Gin Leu Phe Gly 
680 685 690 



2179 



aaa gat aaa gae ctg ttt ata gtt gaa aat ace tet eat aac tat tta 
Lys Asp Lys Asp Leu Phe lie Val Glu Asn Thr Ser His Asn Tyr Leu 
695 700 705 



2227 



gcg teg eta tae ctg caa eat cga gca ttc eta gga gga ett ccc atg 
Ala Ser Leu Tyr Leu Gin His Arg Ala Phe Leu Gly Gly Leu Pro Met 
710 715 720 725 



2275 



ccc tea ttt gga agt ate ace gae atg ctg aaa gat att ect etc att 
Pro Ser Phe Gly Ser lie Thr Asp Met Leu Lys Asp lie Pro Leu lie 
730 735 740 



2323 



ttg aat gcc cag eta age tac age tae act aaa aat gat atg gat act 
Leu Asn Ala Gin Leu Ser Tyr Ser Tyr Thr Lys Asn Asp Met Asp Thr 
745 750 755 



2371 



cgc tat act tec tat ect gaa get caa gge tet tgg acc aat aac tet 
Arg Tyr Thr Ser Tyr Pro Glu Ala Gin Gly Ser Trp Thr Asn Asn Ser 
760 765 770 



2419 



ggg get eta gag etc gga gga tet ctg get eta tat etc ect aaa gaa 
Gly Ala Leu Glu Leu Gly Gly Ser Leu Ala Leu Tyr Leu Pro Lys Glu 
775 780 785 



2467 



gea ccg ttc ttc cag gga tat ttc ccc ttc tta aag ttc cag gca gtc 
Ala Pro Phe Phe Gin Gly Tyr Phe Pro Phe Leu Lys Phe Gin Ala Val 
790 795 800 805 



2515 



tac age ege caa caa aac ttt aaa gag agt gge get gaa gcc cgt get 
Tyr Ser Arg Gin Gin Asn Phe Lys Glu Ser Gly Ala Glu Ala Arg Ala 
810 815 820 



2563 



ttt gat gat gga gae eta gtg aac tgc tet ate ect gtc gge att egg 
Phe Asp Asp Gly Asp Leu Val Asn Cys Ser lie Pro Val Gly lie Arg 
825 830 835 



2611 



tta gaa aaa ate tee gaa gat gaa aaa aat aat ttc gag att tet eta 
Leu Glu Lys lie Ser Glu Asp Glu Lys Asn Asn Phe Glu lie Ser Leu 
840 845 850 



2659 



gee tac att ggt gat gtg tat cgt aaa aat ccc cgt teg cgt act tet 
Ala Tyr lie Gly Asp Val Tyr Arg Lys Asn Pro Arg Ser Arg Thr Ser 
855 860 865 



2707 
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eta atg gtc agt gga gcc tct tgg act teg eta tgt aaa aac etc gca 2755 

Leu Met Val Ser Gly Ala Ser Trp Thr Ser Leu Cys Lys Asn Leu Ala 
870 875 880 885 

cga caa gcc ttc tta gca agt get gga age cat ctg act etc tec cet 2803 

Arg Gin Ala Phe Leu Ala Ser Ala Gly Ser His Leu Thr Leu Ser Pro 
890 895 900 

eat gta gaa etc tct ggg gaa get get tat gag ctt cgt ggc tea gca 2851 

His Val Glu Leu Ser Gly Glu Ala Ala Tyr Glu Leu Arg Gly Ser Ala 
905 910 915 

cac ate tac aat gta gat tgt ggg eta aga tae tea ttc tagttectae 2900 

His lie Tyr Asn Val Asp Cys Gly Leu Arg Tyr Ser Phe 
920 925 930 

tttcctccet aaacttttag ggaggaatte ttataaaaac cetgtagatt cttaacttae 2 960 



tagtetetee tttcetcttg ctttctttaa tttattgeag 



3000 



MAR 0 8 2004 
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Figure 8 Restriction enzyme analysis of CPNl 00639 



Ddel 
CviJI 
Bfal I 
Avrlll I I Hpyl78III 
BsaJI I I I Bfal I 

Styl I I I Xbal I I 

I I I I II I I III 

GTACGAAGTTCTTCACGAAATTATAATACAAACCTAGGCTCTAAGTTTTGTTTCTAGATT 

1 + + + + + ^ gQ 

CATGCTTCAAGAAGTGCTTTAATATTATGTTTGGATCCGAGATTCAAAACAAAGATCTAA 



Mboll 
Rsal 



Tsp509I 
Hpyl78III I 
XmnI I 



Tsp509I 
Msel 
Pad 
Vspl 
Tsp509I I 

Afllll I I I Tthlllll 

Taqi Maell Msel | jj SfaNI | CviRI 

I I II II II I 

ATCGAAAACGTGTTAATTAATTGAACCCAAGCATCTTTCTATGAAAATACCCTTGCACAA 

6X + + + + + + 120 

TAGCTTTTGCACAATTAATTAACTTGGGTTCGTAGAAAGATACTTTTATGGGAACGTGTT 



Plel 
Dpnl 
Sau3AI [ 
Hpyl78IIl| I 
MboII I I I 
I II I 



Maelll 
Tsp45I 
PshAI 
Earl I 



Hinfl I I CviRI 
TaqI I I I BsrDI | SfaNI 

I I I I III 
ACTCCTGATCTCTTCGACTCTTGTCACTCCCATTCTATTGAGCATTGCAACTTACGGAGC 

121 + + + + + + 180 

TGAGGACTAGAGAAGCTGAGAACAGTGAGGGTAAGATAACTCGTAACGTTGAATGCCTCG 



CviJI 
Fnu4HI 
Taul 
Acil 
CacSI 
Acil 
BsrBI 
Bed 
Faul 
Sthl32I I 
Alul I I 

Sfd CviJI II I I I I I I BsaXI 
I III 

AGATGCTTCTTTATCCCCTACAGATAGCTTTGATGGAGCGGGCGGCTCTACATTTACTCC 

181 + + + + + + 240 

TCTACGAAGAAATAGGGGATGTCTATCGAAACTACCTCGCCCGCCGAGATGTAAATGAGG 
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Sf Cl 

SfaNI I Hpyl78III SfaNI 

II- II 
AAAATCTACAGCAGATGCCAATGGAACGAACTATGTCTTATCAGGAAATGTCTATATAAA 

241 + + + + + + 300 

TTTTAGATGTCGTCTACGGTTACCTTGCTTGATACAGAATAGTCCTTTACAGATATATTT 



301 



Fnu4HI 
Tsel 

Msel Fnu4HI | 
Bbvl I Cvi JI I I 
Bbvl I I Tsel I II Sthl32I 

III III 
CGATGCTGGGAAAGGCACAGCATTAACAGGCTGCTGCTTTACAGAAACTACGGGTGATCT 

+ + + + + + 

GCTACGACCCTTTCCGTGTCGTAATTGTCCGACGACGAAATGTCTTTGATGCCCACTAGA 



Hpyl8 8IX 
Dpnl 
Sau3AI 
BscGI 
Bsp24I 
Cjel 
CjePI 
I 



360 



361 



Faul NspV 

Taal TaqI 

Cjel Sthl32l| Bbvl I 

BsrI CjePI I Bsbl || Acelll | | 

HphI BciVI I Bsp24l| | SfaNI | j | Acil | | | CviRI 

III III I I II I III I 

GACATTTACTGGAAAGGGATACTCATTTTCATTCAACACGGTAGATGCGGGTTCGAATGC 

CTGTAAATGACCTTTCCCTATGAGTAAAAGTAAGTTGTGCCATCTACGCCCAAGCTTACG 



420 



CacBI 
Fnu4HI 
Taul 
Acil 
Fnu4HI 

Alul 
AlwNI 
BstAPI 

Cvi JI III I Cvi JI 

Mwol III I MboII I 

Tsel III I Mwol I I 

BsmI I I I I I Tthlllll I I I 

II II 

AGGAGCTGCGGCAAGCACAACTGCTGATAAAGCCCTAATCTTCACAGGATTTTCTAACCT 

421 + + + + + + 480 

TCCTCGACGCCGTTCGTGTTGACGACTATTTCGGGATTAGAAGTGTCCTAAAAGATTGGA 



UJ 
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481 



Eco57I 
ScrFI 
AlwNI 
EcoRII 
Alul 
CviJI 

Fnu4HI I j i jAcelll Rsal 
CviRI I j III Sfcl Hpyl78III Seal CviRI 

BsrDI Tselj I III Bbvl|TaaI | TatI | Msel Sfcl | 

I II I III II I I III II 

TTCCTTCATTGCAGCTCCTGGAACTACAGTTGCTTCAGGAAAAAGTACTTTAAGTTCTGC 

AAGGAAGTAACGTCGAGGACCTTGATGTCAACGAAGTCCTTTTTCATGAAATTCAAGACG 



540 



Msel 
Cjel I 

CviJI I I CjePI 
Nlaivj I I Hinfl | CviJI BsmAI 

PstI II II Tfil I Cjel I Maell BsmBI 

I II II I I II I I 

AGGAGCCTTAAATCTTACCGATAATGGAACGATTCTCTTTAGCCAAAACGTCTCCAATGA 

541 + + + + + + 600 

TCCTCGGAATTTAGAATGGCTATTACCTTGCTAAGAGAAATCGGTTTTGCAGAGGTTACT 



Dpnl 
Sau3AX 
CjePI 
Ecil I 

HphI I I I CjePI 
Acil| I I I Mbollj 

II I I I II 
AGCTAATAACAATGGCGGAGCGATCACCACAAAAACTCTTTCTATTTCTGGGAATACCTC 

601 + + + + + + 660 

TCGATTATTGTTACCGCCTCGCTAGTGGTGTTTTTGAGAAAGATAAAGACCCTTATGGAG 



Alul 
CviJI CjePI 
I I 



661 



Bbvl 

Sfcl Fnu4HI 

Mnll Bfal Tsp509I Dpnl | Alul Taul 

Earl I Spelj Hhal | Sau3AI | | CviJI Acil| 

II II I I I II I II 

TTCTATAACCTTCACTAGTAATAGCGCAAAAAAATTAGGTGGAGCGATCTATAGCTCTGC 

AAGATATTGGAAGTGATCATTATCGCGTTTTTTTAATCCACCTCGCTAGATATCGAGACG 



720 



BsrI 
CviJI 
Haelll 

CviRI Eael 
Fnu4HI I Gdill 
CviJI I I Mspl] 
Tselj I Hpyl78III BsrFI I I 
II I I III 



Cjel 
I 



GGCTGCAAGTATTTCAGGAAACACCGGCCAGTTAGTCTTTATGAATAATAAAGGAGAAAC 

721 + + + + + + 780 

CCGACGTTCATAAAGTCCTTTGTGGCCGGTCAATCAGAAATACTTATTATTTCCTCTTTG 
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Bsri 
Bmrl I 

I I 



Alul 
CviJI 
CacSI 
CviJI 

Banll Cjel | 

Bspl286I BseRl| I 
CviJI I CviJI I I I 

II I II I 



Mnll 
Acelll I 
Taqi I I 

I I I 



MboII 
Alul I 
CviJI I 



TGGGGGTGGGGCTCTGGGCTTTGAAGCCAGCTCCTCGATTACTCAAAATAGCTCCCTTTT 

781 + + + + + + 

ACCCCCACCCCGAGACCCGAAACTTCGGTCGAGGAGCTAATGAGTTTTATCGAGGGAAAA 



840 



Faul 
CacSI 
PstI 
Sthl32I 
CviRI 
Fnu4HI 
Sf cl 
AlwNI 



Tsel 
BstAPI 
TspRI I 
CviRI 



BtsI I 
Bbvl I I 
Hpyl78III SfaNI j j 

I III 



Mwol 



CviJI 
Haelll 
NlalV 
Sau96I I 
Acil I I 
Cjel I I I 
I I I I 



CTTCTCTGGAAACACTGCAACAGATGCTGCAGGCAAGGGCGGGGCCATTTATTGTGAAAA 

841 + + + + + + 900 

GAAGAGACCTTTGTGACGTTGTCTACGACGTCCGTTCCCGCCCCGGTAAATAACACTTTT 



Hin4I 

Cjel I EC057I 

Plel Cjel II Cjel I Bce83I 

BsmAI I Hinfl | j Hpyl78III Hpyl88IX | j MboII | 

I I I II I I II I I 

AACAGGAGAGACTCCTACTCTTACTATCTCTGGAAATAAAAGTCTGACCTTCGCCGAGAA 

901 + + + + + + 960 

TTGTCCTCTCTGAGGATGAGAATGATAGAGACCTTTATTTTCAGACTGGAAGCGGCTCTT 



i/ Fig.s e Cc o n't). 
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Dpnl 



Bglll 
BstYI 
Sau3AI 
Hpyl78III 
Bfal 
Bbvl 
Xbal 
Nlalll 
BsaJI 
Bspl286I 
BstDSI 
Ncol 
Styl 
Bmgl 



Earl 
Maelll Smll 



BseSI 
BstAPI I 
Mwol I 



Ecil 
Acil I 

I I II II 
CTCTTCAGTAACTCAAGGCGGAGCAATCTGTGCCCATGGTCTAGATCTTTCCGCTGCTGG 
961 + + + + + + 1020 

GAGAAGTCATTGAGTTCCGCCTCGTTAGACACGGGTACCAGATCTAGAAAGGCGACGACC 



CviJI 
Haelll 
Sau96I 
CacSI 
Fnu4HI 
MspAlI I 
Tsel I 
Acil I I 
I II 



Faul 
Sthl32l| Acil 
SfaNI I I Bbvl I 

I II II 



CacSI 
PstI 
CviRI 
Fnu4HI 
Sf cl 
Alul 
CviJI 
MspAlI 
PvuII 
Tsel 
I 



Fnu4HI 
Taul 
Acil |HaeII 
Mwol |HhaI I 
II II 



CCCTACCCTATTTTCAAATAATAGATGCGGGAACACAGCTGCAGGCAAGGGCGGCGCTAT 

1021 + + + + + + 1080 

GGGATGGGATAAAAGTTTATTATCTACGCCCTTGTGTCGACGTCCGTTCCCGCCGCGATA 



Plel 
Muni I 
Tsp509I I 
CviRI I j 
II I 



Dpnl 
BstYI 
Sau3AI 
Hpyl78III 
HaeIV| 
Hin4I I 
Hinf I I I 
I I I 



1081 



BsmAI 

Alwl CviRI I BsaJI 

Msel I BsmAI I |CjeI Maell Styl 

II I I I I II 
TGCAATTGCCGACTCTGGATCTTTAAGTCTCTCTGCAAATCAAGGAGACATCACGTTCCT 
+ + + + + ^ 1140 

ACGTTAACGGCTGAGACCTAGAAATTCAGAGAGACGTTTAGTTCCTCTGTAGTGCAAGGA 



Fig. 8 F(cQn't) 
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Bsbl 



Cjel 
I 



Mnll 
Hhal 
Thai I 
Mnll I I 
Acilll I 
III I 



TaqI 



Sau3AI 
ScrFI I 
BsaJl| I 
EcoRII I I I 
Bsmi III I 

I III I 



TGGCAACACTCTAACCTCAACCTCCGCGCCAACATCGACACGGAATGCTATCTACCTGGG 

1141 + + + + + + 1200 

ACCGTTGTGAGATTGGAGTTGGAGGCGCGGTTGTAGCTGTGCCTTACGATAGATGGACCC 



Bbvl 
CviJI 
Hael 
Haelll 
BsaJI 
Styl 
CviJI 



Dpnl 



Tsp509I 
Alwl I 



Fnu4HI 
Tsel 
CjePI 
Msel I 
Aflll| I 
Smllj I 
I I I 



Dpnl 
CjePI I 
Sau3AI I I 
Alwl I I I 

I III 



ATCGTCAGCAAAAATTACGAACTTAAGGGCAGCCCAAGGCCAATCTATCTATTTCTATGA 

1201 + + + + + + 1260 

TAGCAGTCGTTTTTAATGCTTGAATTCCCGTCGGGTTCCGGTTAGATAGATAAAGATACT 



Hpyl8 8IX 
Mae 1 1 
Hpyl88IX I 

EC057I Alul I I I Mspl 

HpylBSIX CviRI | Sf aNI CviJI | j | Bed BsaWI j Hin4I 

I I I I I I I I I II I 

TCCGATTGCATCTAACACCACAGGAGCTTCAGACGTTCTGACCATCAACCAACCGGATAG 

1261 + + + + + + 1320 

AGGCTAACGTAGATTGTGGTGTCCTCGAAGTCTGCAAGACTGGTAGTTGGTTGGCCTATC 



Bbvl 
PstI 

Alul CviRI 1 

Hpyl78III CviJI Sfcl | | 

I I I M 

CAACTCGCCTTTAGATTATTCAGGT^CGATTGTATTTTCTGGGGAAAAGCTCTCTGCAGA 

1321 + + + + + + 1380 

GTTGAGCGGAAATCTAATAAGTCCTTGCTAACATAAAAGACCCCTTTTCGAGAGACGTCT 



Fnu4HI 
Alul I 

CviJI I CviJI 
Mwol I Bf al I 

Mwol Tsel I Msel CviJI | j 

IN I III 

TGAAGCGAAAGCTGCTGATAACTTCACATCTATATTAAAGCAACCATTGGCTCTAGCCTC 

1381 + + + + + + 1440 

ACTTCGCTTTCGACGACTATTGAAGTGTAGATATAATTTCGTTGGTAACCGAGATCGGAG 



.Fig. 8 S (con't) 
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BpulOI 
Ddel 
Mnll 
Nlaiv I 
Hpyl78III I I 
I I I 



Taqi 



TGGAACCTTAGCACTCAAAGGAAATGTCGAGTTAGATGTCAATGGTTTCACACAGACTGA 

1441 + + + + + + 1500 

ACCTTGGAATCGTGAGTTTCCTTTACAGCTCAATCTACAGTTACCAAAGTGTGTCTGACT 



CviJI 
BseRI I 
I I 



ScrFI 
Mnll 
EcoRII 
CviRI I 
Nlalll j 
Eco57l| I 

I I I 



Alul 
CviJI 



Alul 
AlwNI CviJI 



AGGCTCTACACTCCTCATGCAACCAGGAACAAAGCTCAAAGCAGATACTGAAGCTATCAG 

1501 + + + + + + 

TCCGAGATGTGAGGAGTACGTTGGTCCTTGTTTCGAGTTTCGTCTATGACTTCGATAGTC 



1560 



Dpnl Ddel 
EC057I Sau3AI I Mnll | 

I II II 

TCTTACCAAACTTGTCGTTGATCTTTCTGCCTTAGAGGGAAATAAGAGTGTGTCCATTGA 

1561 + + + + + + 1620 

AGAATGGTTTGAACAGCAACTAGAAAGACGGAATCTCCCTTTATTCTCACACAGGTAACT 

CviJI 

Nlaiv| BseRI Mnll Mnll Acil 

II I III 

AACAGCAGGAGCCAACAAAACTATAACTCTAACCTCTCCTCTTGTTTTCCAAGATAGTAG 

1621 + + + + + + 1680 

TTGTCGTCCTCGGTTGTTTTGATATTGAGATTGGAGAGGAGAACAAAAGGTTCTATCATC 

Tsp509I ^ CviJI 

Fnu4HI I Fnu4HI | Bbvl 

Taul I CviJI CviJI Tsel | | Bbvl | 

II I I II I I I 

CGGCAATTTTTATGAAAGCCATACGATAAACCAAGCCTTCACGCAGCCTTTGGTGGTATT 

1681 + + + + + + 1740 

GCCGTTAAAAATACTTTCGGTATGCTATTTGGTTCGGAAGTGCGTCGGAAACCACCATAA 



TspRI 
Fnu4HI 
Tsel I 
BtsI I I 
Bbvl I I j 
I I II 



CacSI 
Bfal 
Nhel 
Fnu4HI 
Mwol I 
Tsel I 
Mwol I I 
I II 



Bpml 
Clal I 
TaqI I Hhal 



Rsal 
TatI I 
BsrI I I 



1741 



SfaNI TaqI | Hhal Bpml 

I I I 

CACTGCTGCTACTGCTGCTAGCGATATTTATATCGATGCGCTTCTCACTTCTCCAGTACA 
+ ^ ^ + + ^ 1800 

GTGACGACGATGACGACGATCGCTATAAATATAGCTACGCGAAGAGTGAAGAGGTCATGT 
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Mnll 

Sthl32I BscGl| BsmFI Bgll 

Hpyl78III I BslI || CjePI Cvi JI | Mwol 

I I I II I II I 

AACTCCAGAACCTCATTACGGGTATCAGGGACATTGGGAAGCCACTTGGGCAGACACATC 

1801 + + + + + + 1860 

TTGAGGTCTTGGAGTAATGCCCATAGTCCCTGTAACCCTTCGGTGAACCCGTCTGTGTAG 

BpulOI 

Hpyl78III Ddel 
CviRI I Maelll CviJI Hpyl78III 

CjePI I I Sthl32l| BscGI |BseMII | 

III II III I 

AACTGCAAAATCAGGAACTATGACTTGGGTAACTACGGGCTACAACCCTAATCCTGAGCG 

1861 + + + + + + 1920 

TTGACGTTTTAGTCCTTGATACTGAACCCATTGATGCCCGATGTTGGGATTAGGACTCGC 

Sthl32I 
Hinf I 
Tf il 
Fokl I 

Alul Hpyl78IIl|i I CjePI Bce83I 

CviJI CjePI III I SfaNI | Sfcl j 

I I III I II II 

TAGAGCTTCCGTAGTTCCCGATTCATTATGGGCATCCTTTACTGACATTCGCACTCTACA 

1921 + + + + + + 1980 

ATCTCGAAGGCATCAAGGGCTAAGTAATACCCGTAGGAAATGACTGTAAGCGTGAGATGT 

Nlalll 
Hpyl78III 
Real I 

Dpnl I I I Plel 
Sau3AI I I I I Smll Mnll j Hinfl Hpyl78III 

I II I I I III I 

GCAGATCATGACATCTCAAGCGAATAGTATCTATCAGCAACGAGGACTCTGGGCATCAGG 

1981 + + + + + + 2040 

CGTCTAGTACTGTAGAGTTCGCTTATCATAGATAGTCGTTGCTCCTGAGACCCGTAGTCC 

Apol Alul 

Tsp509I CviJI 

MboII I Tthlllll I 

AlwNI I I Hpyl8 8IX | | 

SfaNI I |XmnI Hpyl78III BsmI | Mmel | | 

I I I I I I I I I I 

AACTGCGAATTTCTTCCATAAGGATAAATCAGGAACTAACCAAGCATTCCGACATAAAAG 

2041 + + + + + + 2100 

TTGACGCTTAAAGAAGGTATTCCTATTTAGTCCTTGATTGGTTCGTAAGGCTGTATTTTC 

MboII 

Mbollj Alul 

Mmel Hpyl8 8IX| j CviJI 

CviJI Mnll I Beef I Eco57I ||| Eco57I TspRI I 

I I I I I III I II 

CTACGGCTATATTGTTGGAGGAAGTGCTGAAGATTTTTCTGAAAATATCTTCAGTGTAGC 

2101 + + + + + + 2160 

GATGCCGATATAACAACCTCCTTCACGACTTCTAAAAAGACTTTTATAGAAGTCACATCG 



MAR 0 8 Z004 J 
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Alul 
CviJI 
Cac8I I 
MboII I I 



Acelll 
Earl I 

Sapl I Hgal 

I I I I I I 

TTTCTGCCAGCTCTTCGGTAAAGATAAAGACCTGTTTATAGTTGAAAATACCTCTCATAA 

2161 + + + + + + 2220 

AAAGACGGTCGAGAAGCCATTTCTATTTCTGGACAAATATCAACTTTTATGGAGAGTATT 



Bf al 
Avrll I 
BsaJI I 
Styl I 

CviRI Taqi Mnll | 
Mnll Mwol |BspMI |BsmI | || Nlalll 

I I I II I I I 

CTATTTAGCGTCGCTATACCTGCAACATCGAGCATTCCTAGGAGGACTTCCCATGCCCTC 

2221 + + + + + + 2280 

GATAAATCGCAGCGATATGGACGTTGTAGCTCGTAAGGATCCTCCTGAAGGGTACGGGAG 



Mnll 
HphI I 
BslI I I 

I I I 



Nlalll 
MslI Nspl 



Alul 
CviJI 
Bpull02I 
Ddel 
Alul I 
CviJI I 
BsmI I I 

I I I 



Mnll 

I I 
ATTTGGAAGTATCACCGACATGCTGAAAGATATTCCTCTCATTTTGAATGCCCAGCTAAG 

2281 + + + + + + 2340 

TAAACCTTCATAGTGGCTGTACGACTTTCTATAAGGAGAGTAAAACTTACGGGTCGATTC 



Alul Smll 
CviJI Hpyl78III Alul I 

Sfcl I BciVI Bce83I | CviJI j CviJI 

II I I I II I 

CTACAGCTACACTAAAAATGATATGGATACTCGCTATACTTCCTATCCTGAAGCTCAAGG 

2341 + ^ + + + + + 2400 

GATGTCGATGTGATTTTTACTATACCTATGAGCGATATGAAGGATAGGACTTCGAGTTCC 



Fig. 8 4 (Gon't) 
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Dpnl 
BstYI 
Sau3AI 
Hpyl8 8IX 
Banll 



EC057I 
Avail I 
Sau96I I 
I I 



BsiHKAI 
Bspl286I 
Sad 
Alul 
CviJI 
Mnll 
CjePI 
Hpyl78III 
Bf al 
Banll 
Bspl286I 
Xbal 
CviJI I 
I I 



CviJI 
Alwl I 
I I 



CjePI 



CTCTTGGACCAATAACTCTGGGGCTCTAGAGCTCGGAGGATCTCTGGCTCTATATCTCCC 

2401 + + + + + + 2460 

GAGAACCTGGTTATTGAGACCCCGAGATCTCGAGCCTCCTAGAGACCGAGATATAGAGGG 



ScrFI 
BsaJI I 

EcoRII I I ScrFI Sf cl 

MboII Taal III Msel EcoRII | Accl| CviJI 

I I III I II III 

TAAAGAAGCACCGTTCTTCCAGGGATATTTCCCCTTCTTAAAGTTCCAGGCAGTCTACAG 

2461 + + + + + + 

ATTTCTTCGTGGCAAGAAGGTCCCTATAAAGGGGAAGAATTTCAAGGTCCGTCAGATGTC 



2520 



Bed 

Acil Sthl32I Eco57l| 

Fnu4HI Dral Haell BscGI | Bsal | j Bfal 

Taul Msel I HhaljCviJI | |BsmAI || BsaXI j 

I II II I I I I II II 

CCGCCAACAAAACTTTAAAGAGAGTGGCGCTGAAGCCCGTGCTTTTGATGATGGAGACCT 

2521 + + + + + + 2580 

GGCGGTTGTTTTGAAATTTCTCTCACCGCGACTTCGGGCACGAAAACTACTACCTCTGGA 

BsmI Hpyl8 8IX MboII 

I I I 

AGTGAACTGCTCTATCCCTGTCGGCATTCGGTTAGAAAAAATCTCCGAAGATGAAAAAAA 

2581 + + + + + + 2640 

TCACTTGACGAGATAGGGACAGCCGTAAGCCAATCTTTTTTAGAGGCTTCTACTTTTTTT 



Hpyl78III CviJI Sthl32I 

Tsp509I TaqI j Bfal j BsaBI HphI BscGI Thai 

III II II II 

TAATTTCGAGATTTCTCTAGCCTACATTGGTGATGTGTATCGTAAAAATCCCCGTTCGCG 

2641 + + + + + + 2700 

ATTAAAGCTCTAAAGAGATCGGATGTAACCACTACACATAGCATTTTTAGGGGCAAGCGC 



mo 9 2m 



^^^^ 



0^ Fig. 8 K (e on't ) 
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CviJI 

NlaIV| Mnll 
Rsal TspRI I Mnll Mwol 

I II I I 

TACTTCTCTAATGGTCAGTGGAGCCTCTTGGACTTCGCTATGTAAAAACCTCGCACGACA 

2701 + + + + + + 2760 

ATGAAGAGATTACCAGTCACCTCGGAGAACCTGAAGCGATACATTTTTGGAGCGTGCTGT 



Hinf I 
HpylSSIX 
Bed I 

Mwol Plel| I 

CviJI Ddel I CviJI | j | 

I II Mi I 

AGCCTTCTTAGCAAGTGCTGGAAGCCATCTGACTCTCTCCCCTCATGTAGAACTCTCTGG 

2761 + + + + + + 2820 

TCGGAAGAATCGTTCACGACCTTCGGTAGACTGAGAGAGGGGAGTACATCTTGAGAGACC 



Bbvl 
Mnll I 
Nlalll I I 
I II 



2821 



Fnu4HI 

Alul I Bpull02I 

CviJI I Alul Ddel RleAI Ddel 

Tselj CviJI CviJI I BseMII | Cvi JI | 

II I II II II 
GGAAGCTGCTTATGAGCTTCGTGGCTCAGCACACATCTACAATGTAGATTGTGGGCTAAG 
+ + + + + ^ 2880 

CCTTCGACGAATACTCGAAGCACCGAGTCGTGTGTAGATGTTACATCTAACACCCGATTC 



Apol 

Mnll EcoRI 
Bfal Mnll I Tsp509I 

I III 

ATACTCATTCTAGTTCCTACTTTCCTCCCTAAACTTTTAGGGAGGAATTCTTATAAAAAC 

2881 + + + + + + 2940 

TATGAGTAAGATCAAGGATGAAAGGAGGGATTTGAAAATCCCTCCTTAAGAATATTTTTG 

Tsp509I 

Hinf I Bfal Msel| 

Sfcl Tfil Msel Spell BsmAI Mnll || CviRI 

I I I II I I II I 

CCTGTAGATTCTTAACTTACTAGTCTCTCCTTTCCTCTTGCTTTCTTTAATTTATTGCAG 

2941 + + + + + + 3000 

GGACATCTAAGAATTGAATGATCAGAGAGGAAAGGAGAACGAAAGAAATTAAATAACGTC 



0 « 
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Figure 9 A: Nucleotide and amino acid sequence of CPN 100708 
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ataaagtttt ttatatgaac aaaactttga cttcattggc agacacttcg accttaacag 60 

accaattttg ttgtcatccc tataaaaatc aggaattttc atg etc tec tea eta 115 

Met Leu Ser Ser Leu 
1 5 

ate cgt gat tea ttt ccc ctt ett att tta ett eec aea ttc eta geg 163 
lie Arg Asp Ser Phe Pro Leu Leu lie Leu Leu Pro Thr Phe Leu Ala 
10 15 20 

gca tta gga gee tec gta get ggc ggc gtt atg gga aec tat ate gtt 211 
Ala Leu Gly Ala Ser Val Ala Gly Gly Val Met Gly Thr Tyr lie Val 
25 30 35 

gta aaa egt att gtt tea att agt gga agt ata tet eat gea att eta 259 
Val Lys Arg lie Val Ser lie Ser Gly Ser lie Ser His Ala lie Leu 
40 45 50 

99^ 99a att gge ete aec eta tgg ata eaa tat aag ctt cat etc tet 307 
Gly Gly lie Gly Leu Thr Leu Trp lie Gin Tyr Lys Leu His Leu Ser 
55 60 65 

ttt ttc cet atg tat gga get att gta gga get att ttt eta get ett 355 
Phe Phe Pro Met Tyr Gly Ala lie Val Gly Ala lie Phe Leu Ala Leu 
70 75 80 85 

tgc ate gge aag ate eac etg aaa tac eaa gaa agg gaa gac tet ttg 4 03 
Cys lie Gly Lys lie His Leu Lys Tyr Gin Glu Arg Glu Asp Ser Leu 
90 95 100 

att geg atg att tgg tet gtg gge atg gca att gga att ata ttc att 451 
lie Ala Met lie Trp Ser Val Gly Met Ala lie Gly lie lie Phe lie 
105 110 115 

tec agg ctt ccc aec ttt aat gga gag etc ate aat ttt eta ttt ggg 499 
Ser Arg Leu Pro Thr Phe Asn Gly Glu Leu lie Asn Phe Leu Phe Gly 
120 125 130 

aae att etc tgg gte aec cet tea gac etc tat age tta gga ate ttt 547 
Asn lie Leu Trp Val Thr Pro Ser Asp Leu Tyr Ser Leu Gly lie Phe 
135 140 145 

gat ctt ett gtt tta gga att gtg gte ctt tgc eac aec egg ttc ett 595 
Asp Leu Leu Val Leu Gly lie Val Val Leu Cys His Thr Arg Phe Leu 
150 155 160 165 

get ctt tgc ttt gat gag agg tac acg get tta aae cat tgt tet gta 643 
Ala Leu Cys Phe Asp Glu Arg Tyr Thr Ala Leu Asn His Cys Ser Val 
170 175 180 



cag etg tgg tat ttc eta ett ett gtt etg aca gca ate acg att gtg 691 
Gin Leu Trp Tyr Phe Leu Leu Leu Val Leu Thr Ala lie Thr lie Val 
185 190 195 
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atg ttg att tat gtg atg gga acg att ctg atg ctt age atg etc gtc 73 9 
Met Leu lie Tyr Val Met Gly Thr lie Leu Met Leu Ser Met Leu Val 
200 205 210 

tta cct gtt get ata gcg tgt aga ttt teg tac aag atg aea cga att 787 
Leu Pro Val Ala lie Ala Cys Arg Phe Ser Tyr Lys Met Thr Arg lie 
215 220 225 

atg ttc ate teg gtc etc ttg aat ate tta tgt tct ttt tet gga att 835 
Met Phe lie Ser Val Leu Leu Asn lie Leu Cys Ser Phe Ser Gly lie 
230 235 240 245 

tgc ate gee tae tgt tta gat tte eca gta ggt cct acg ata tea ttg 883 
Cys lie Ala Tyr Cys Leu Asp Phe Pro Val Gly Pro Thr lie Ser Leu 
250 255 260 

ctg atg ggg tta ggt tat aca gcg agt ctt tgt gtg aag aag egg tae 931 
Leu Met Gly Leu Gly Tyr Thr Ala Ser Leu Cys Val Lys Lys Arg Tyr 
265 270 275 

aat eeg teg aeg eet tet cct gta agt cct gaa ate aat aca aat gta 979 
Asn Pro Ser Thr Pro Ser Pro Val Ser Pro Glu lie Asn Thr Asn Val 
280 285 290 

tagctaggga agcgcttttg gaagetttgg aggcattett eetgttegtc aggaagaaga 1039 

tcatcaattt tatttaaagc taccagcata tctttetttt eaaaatetgg ctgatgagag 1099 



t 



1100 
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Figure 10^: Restriction enzyme analysis of CPNl 00708 



Bcgl TaqI Msel 

I I I 

ATAAAGTTTTTTATATGAACAAAACTTTGACTTCATTGGCAGACACTTCGACCTTAACAG 
1 + + + + + ^ 50 

TATTTCAAAAAATATACTTGTTTTGAAACTGAAGTAACCGTCTGTGAAGCTGGAATTGTC 



BseRI 
Apol j 
Tsp509I 



Hpyl78III I iNlalll 



Mnll 



Tsp509I 
Bcgll 
Fokl I I 

III I I I I I 

ACCAATTTTGTTGTCATCCCTATAAAAATCAGGAATTTTCATGCTCTCCTCACTAATCCG 

TGGTTAAAACAACAGTAGGGATATTTTTAGTCCTTAAAAGTACGAGAGGAGTGATTAGGC 



120 



RleAI 
Fnu4HI 
Taul 

Hinfl Acil| 
Tf il Bfal I I 

I III 
TGATTCATTTCCCCTTCTTATTTTACTTCCCACATTCCTAGCGGCATTAGGAGCCTCCGT 

121 + + + + + + 

ACTAAGTAAAGGGGAAGAATAAAATGAAGGGTGTAAGGATCGCCGTAATCCTCGGAGGCA 



BsaXI 
CviJI I 
Nlaiv| I 
Mwol I I |MwoI 
I III I 



180 



Fnu4HI 
Taul 
Acil 
Cac8I 
Mnll 
Alul I 

CviJI I I I NlalV Maell Tsp509I 

I Ml I I I 

AGCTGGCGGCGTTATGGGAACCTATATCGTTGTAAAACGTATTGTTTCAATTAGTGGAAG 

181 + + + + + + 240 

TCGACCGCCGCAATACCCTTGGATATAGCAACATTTTGCATAACAAAGTTAATCACCTTC 



Bfal 
Mnll 
Tsp509I I 
CviRI I j 
Nlalllj I 
II 1 



CviJI 
Hael 

Haelll Alul 

HphI I CviJI 

Tsp509l| j BciVI Mnll Hindlll | 

II I I I II 



TATATCTCATGCAATTCTAGGAGGAATTGGCCTCACCCTATGGATACAATATAAGCTTCA 

241 + + + + + + 300 

ATATAGAGTACGTTAAGATCCTCCTTAACCGGAGTGGGATACCTATGTTATATTCGAAGT 



BsaXI Alul 
Alul I Alul CviJI 

BslI CviJI j CviJI Bfal | CviRI 

III I II I 
TCTCTCTTTTTTCCCTATGTATGGAGCTATTGTAGGAGCTATTTTTCTAGCTCTTTGCAT 
301 + + + + 4. + 360 

AGAGAGAAAAAAGGGATACATACCTCGATAACATCCTCGATAAAAAGATCGAGAAACGTA 



MAR 0 8 2004 



Fig. 10® (oon't) 
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Dpnl 
BstYI 
Sau3AI 
SfaNI 
Alwl I 

I I 



Bbsl 
Hinfl |MboII 



RleAI 
I 



Plel 
BslI I 

Mil.. 

CGGCAAGATCCACCTGAAATACCAAGAAAGGGAAGACTCTTTGATTGCGATGATTTGGTC 

GCCGTTCTAGGTGGACTTTATGGTTCTTTCCCTTCTGAGAAACTAACGCTACTAAACCAG 

Banll 
BsiHKAI 
Bspl286I 
Sad 
Alul I 
CviJI j 
Hin4I I 

I I 



420 



Tsp509I 
Muni I 
Tsp509I I 
Nlalll I I 



Msel 



421 



CviJI 
Cjel I 
ScrFI I 
EcoRII I I 

Ml I 

TGTGGGCATGGCAATTGGAATTATATTCATTTCCAGGCTTCCCACCTTTAATGGAGAGCT 
ACACCCGTACCGTTAACCTTAATATAAGTAAAGGTCCGAAGGGTGGAAATTACCTCTCGA 



480 



Hinfl 
Mnll 
BpulOI 

BstEII Ddel 
Cjel Maelll Alul | 

Cjel I HphI SimI Cjel Cvi JI | 

Tsp509I I I EC057I |Tsp45I HpylSBIX | Sf cl |||TfiI 

III II I I I I . Ill I 

CATCAATTTTCTATTTGGGAACATTCTCTGGGTCACCCCTTCAGACCTCTATAGCTTAGG 

481 + + + + + + 

GTAGTTAAAAGATAAACCCTTGTAAGAGACCCAGTGGGGAAGTCTGGAGATATCGAATCC 



540 



Sthl32I 
NlalV 
Mspl I 
Neil I 
ScrFI I 

III II II 

AATCTTTGATCTTCTTGTTTTAGGAATTGTGGTCCTTTGCCACACCCGGTTCCTTGCTCT 

541 + + + + + + 600 

TTAGAAACTAGAAGAACAAAATCCTTAACACCAGGAAACGGTGTGGGCCAAGGAACGAGA 



Dpnl 
Sau3AI I 
MboII I 



Avail 
Cjel 

Tsp509I Sau96I 



Mnll 
Cjel I 



Dral 
Msel I 
Rsal CviJI I I 

I III 



Alul 
CviJI 
MspAlI 
PvuII 
Rsal 
BsrGI I 
TatI j 
Bcefl I I 
I I I 



TTGCTTTGATGAGAGGTACACGGCTTTAAACCATTGTTCTGTACAGCTGTGGTATTTCCT 

601 + + + + + + 

AACGAAACTACTCTCCATGTGCCGAAATTTGGTAACAAGACATGTCGACACCATAAAGGA 



660 



Fig. 10 0 <con^t) 
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Hpyl88IX 
Hinfl I 
SfaNI I I 

Hpyl88IX Hpyl78III | Bcgl Bed |TfiI | 

I II I I I I I 

ACTTCTTGTTCTGACAGCAATCACGATTGTGATGTTGATTTATGTGATGGGAACGATTCT 

661 + + + + + + 720 

TGAAGAACAAGACTGTCGTTAGTGCTAACACTACAACTAAATACACTACCCTTGCTAAGA 



MslI 
Hpyl78III I 



Nlalll 
Nspl 
SphI 

Bpull02I CacSI I 

Ddel Mwol I I Bcgl Sfcl Rsal 

I III I I I 

GATGCTTAGCATGCTCGTCTTACCTGTTGCTATAGCGTGTAGATTTTCGTACAAGATGAC 

721 + + + + + + 780 

CTACGAATCGTACGAGCAGAATGGACAACGATATCGCACATCTAAAAGCATGTTCTACTG 

Hpyl7 8III Apol 
Taqll Avail | Tsp509I 

Tsp509I I Sau96I I Mnll Hpyl78III | CviRI 

II III III 

ACGAATTATGTTCATCTCGGTCCTCTTGAATATCTTATGTTCTTTTTCTGGAATTTGCAT 

781 + )- + + + + + 840 

TGCTTAATACAAGTAGAGCCAGGAGAACTTATAGAATACAAGAAAAAGACCTTAAACGTA 



841 



BsrDI 
EcoRV 
Pf 111081 
Avail 
ECO0109I 
PspSII 
Sau96I 
Sse8647I 
Bcgl I 
Taal Bmrl | | 

SfaNI I Bsrl|| | III Bed Bcgl 

II III I I I I I I 
CGCCTACTGTTTAGATTTCCCAGTAGGTCCTACGATATCATTGCTGATGGGGTTAGGTTA 
^ ^ ^ ^ + ^ 

GCGGATGACAAATCTAAAGGGTCATCCAGGATGCTATAGTAACGACTACCCCAATCCAAT 



900 



BsaXI 
Hin4I 
BsaHI 
Hindi 
MboII Accl| 
Rsal I TaqI | 

Hinfl Plel Acil | | Sail | | | ||HgaI Hpyl78III 

I I I I I III I II I I 

TACAGCGAGTCTTTGTGTGAAGAAGCGGTACAATCCGTCGACGCCTTCTCCTGTAAGTCC 

901 + + + + + + 960 

ATGTCGCTCAGAAACACACTTCTTCGCCATGTTAGGCAGCTGCGGAAGAGGACATTCAGG 



Fig. 10 0) ^on't) 
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Haell Alul 
Bf al Hhal I Cvi JI BsmI 

Alul I Eco47IIl|| Mnll| Mwol | 

CviJlj Mwol I I iHindlll I |MboII I j 

II I III III II I 

TGAAATCAATACAAATGTATAGCTAGGGAAGCGCTTTTGGAAGCTTTGGAGGCATTCTTC 

961 + + + + + + 1020 

ACTTTAGTTATGTTTACATATCGATCCCTTCGCGAAAACCTTCGAAACCTCCGTAAGAAG 

MboII 



MboII 
Tsp509I 
Dpnl I 
Hpyl78III Sau3AI | j 
I III 



Alul 
CviJI 
Dral I 
Msel| I 
II I 

CTGTTCGTCAGGAAGAAGATCATCAATTTTATTTAAAGCTACCAGCATATCTTTCTTTTC 

1021 + + + + + + 1080 

GACAAGCAGTCCTTCTTCTAGTAGTTAAAATAAATTTCGATGGTCGTATAGAAAGAAAAG 

CviJI 
I 

AAAATCTGGCTGATGAGAGT 

1081 + + 1100 

TTTTAGACCGACTACTCTCA 



